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METHODS FOR PREPARING SYNTHETIC REPETITIVE DNA 

The government has certain rights in this invention as a result of support 
provided by the Department of the Navy for the work leading to the present 
invention. 

5 CROSS-REFERENCE TQ RELATED APPLICATIONS 

This application is a continuation-in-part of Application Serial No. 
08/175,155, filed December 29, 1993, which application is a continuation-in-part of 
Application Serial No. 08/053,049, which application through intervening 
applications, is a continuation-in-part of Application Serial No. 927,258, filed 
10 November 4, 1986, now abandoned. 

INTRODUCTION 

Technical Field 

The field is related to the production of high-molecular-weight polymers, 
15 either nucleic acids or peptides that are the expression products of the nucleic acids, 
and is particularly related to the production of high-molecular-weight peptides 
containing repeating sequences by biochemical processes, the peptides finding use as 
structural materials. 

20 PgckgrQWKl 

Recombinant DNA technology has been applied in the isolation of natural 
genes and the expression of these genes in a variety of host cells. Typically, this 
technology has had utility in producing biologically active polypeptides, such as 
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interferons or peptide hormones, which were impractical to produce in useful 
amounts by other means. It was also possible to produce modified proteins by 
isolating natural genes and utilizing the techniques of site specific, in vitro 
mutagenesis to alter these genes and thereby change the polypeptides produced. 
5 Other polypeptides have been created by combining sections of various native genes 
to produce new polypeptides that are chimeric molecules of the several naturally 
occurring molecules. 

With the advent of efficient and automated methods for the chemical 
synthesis of DNA, it has become possible to synthesize entire genes and to modify 

10 such synthetic genes at will during the course of synthesis. However, these various 
technologies have been applied to the production of natural or modified versions of 
natural polypeptides. There have been very few attempts to use these technologies 
to create substantially new polypeptides. In nature, polypeptides have a wide range 
of chemical, physical and physiological characteristics. Nevertheless, there are 

15 commercial applications for which known, naturally occurring polypeptides are not 
appropriate. 

While biotechnology is versatile, usually it has been limited in its 
applications to naturally occurring products or modifications of naturally occurring 
molecules. One great strength of organic chemical synthesis, by contrast, has been 
20 the ability to transform inexpensive carbon materials to a wide variety of polymeric 
molecules, including naturally occurring molecules, but most importantly entirely 
new chemical structures, such as polypropylene and polyacrylates. which have 
defined and predicted chemical properties not associated with naturally occurring 
molecules. 

25 Such materials, particularly high-molecular-weight polymers containing 

repeating sequences of amino acids, have proven difficult to produce by biochemical 
means. The genes necessary for producing large peptides containing repeating units 
of amino acids were unstable and often underwent intermolecular recombination 
causing deletions of repeating units in the gene. The development of a 

30 biotechnology which would produce polymeric molecules by biological processes 
similar to those available by organic synthesis would significantly broaden the range 
of applications of biotechnology. 
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Frfrf DfiSgliptiQD of the Relevant Literature 

The cloning of multiple lactose operators up to four in tandem is disclosed by 
Sadler et al., Gene (1980) 8:279-300. Hybrid bacterial plasmids containing highly 
repeated satellite DNA is disclosed by Brutlag et al., Cell (1977) 10:509-519. The 

5 synthesis of a poly(aspartyl-phenylalanine) in bacteria is disclosed by Doel et al.. 
Nucleic Acids Research (1980) 8:4575-4592. A method for enriching for proline 
content by cloning a plasmid which codes for the production of a proline polymer 
was disclosed by Kangas et al., Applied and Environmental Microbiology (1982) 
43:629-635. The biological limitations on the length of highly repetitive DNA 

10 sequences that may be stably maintained within plasmid replicons is discussed by 
Gupta et al. in Bio/Technology, p. 602-609, September 1983. 

SUMMARY OF THE INVENTION 
Methods are provided for the production of protein polymers having 

15 extended stretches of small repeating units by expression of a synthetic gene. The 
amino acid repeating units are sequences that provide a motif for the protein 
polymer and comprise a major portion of the gene encoding the protein polymer. 
There may be more than one type of amino acid repeating unit in a single protein 
polymer. According to the design of the protein polymer, one or more different 

20 amino acid repeating units and, optionally, one or more amino acid interrupting 
linker or spacer sequences are organized into a "monomer". In the final protein 
polymer, the amino acid monomer is sequentially replicated to achieve the desired 
molecular weight. 

To construct the gene encoding the protein polymer, a DNA monomer 

25 sequence encoding the amino acid monomer sequence is first designed and 
synthesized. There are three different approaches to synthesizing the DNA 
monomer: (1) synthesizing a plurality of dsDNA segments, which when ligated 
either in conjunction with their synthesis or after cloning and subsequent restriction 
enzyme digestion correspond to the desired DNA monomer sequence. Each dsDNA 

30 segment will typically encode a few amino acid repeating units, although the 
segment may encode an amino acid interrupting linker or spacer sequence. The 
dsDNA segments are synthesized by synthesizing single stranded oligomers which at 
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least partially overlap and hybridizing pairs of oligomers to provide dsDNA. The 
dsDNA monomer is then assembled by independently cloning each dsDNA segment 
in a cloning vector and then sequentially inserting into a cloning vector containing 
all or a portion of a First segment all or a portion of each additional dsDNA segment 

5 by restriction enzyme digestion and ligation, by simultaneously cloning in a cloning 
vector all of the dsDNA segments where each individual segment has a 3' or 5' 
terminus complementary to the 5' or 3' terminus of a second segment, and so forth, 
or by combining convenient elements of these methods whereby a monomer is 
obtained with an open reading frame with the proper sequence, which monomer is 

10 sequenced; or (2) synthesizing a single strand of all or a portion of the DNA 

monomer and making the complementary strand, conveniently using the polymerase 
chain reaction (PCR) in which case the synthesized PCR template comprises the 
appropriate sequence of the ssDNA monomer flanked by appropriate primer binding 
site sequences. The resulting dsDNA segment(s) are then digested by restriction 

15 enzyme digestion to remove the primer binding sites, cloned in a cloning vector, and 
if more than one segment has been synthesized, assembled in a cloning vector as 
described above to form the monomer, which is sequenced; or (3) using the 
appropriate restriction enzymes, deleting a portion of a DNA monomer or 
combining all or part of DNA monomers previously prepared as in (1) and/or (2) 

20 above, optionally including new dsDNA segments synthesized as above, and 
characterizing the monomer. Generally, the dsDNA segments will be sequenced 
after each cloning step. dsDNA segments prepared as in (1) and/ or (2) above can be 
combined to form a monomer, which is sequenced. In those cases where new DNA 
is synthesized and introduced into a monomer, at least the new DNA and usually the 

25 entire monomer, will be sequenced. 

The monomers have predetermined termini for oligomerization The 
monomer is then concatenated or oligomerized under ligating conditions to form 
multimers of the monomer, where the multimers may have different numbers of 
monomers resulting in a plurality of genes having different numbers of monomers. 

30 At least one multimer is inserted into an expression vector for introducing the vector 
into an appropriate expression host for expression of the gene. The expression host 
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is then grown under conditions whereby the protein is expressed and may be 
isolated. 



RRTFF DESCRIPTION OF THE FIGURES 
5 Fig. 1: Plasmid pSY701 structure. 

Fig. 2A-B: Immunoblots of polypeptide products using antibody to (a) beta- 
lactamase or to (b) gly-ala-peptide. 

Fig. 3: Construction flowchart for plasmid pGlO/SlpI. 
Fig. 4A-B: Immunoblots of polypeptide products (a) T7gpl0/Slpl with anti- 
10 SlpAb, (b) T7gp9/Slpl with anti-Sip ab or (c) staining with Coomassie blue. 
Fig. 5: Construction flowchart for plasmid pSY856. 
Fig. 6: Time course for accumulation of the kanamycin-resistance gene 
product with the T7 system. 

Fig. 7: Construction flowchart for plasmid pSY857. 
15 Fig. 8: Construction flowchart for plasmid pSY980. 

FIG. 9A-B: (A) Amido black stain or gel containing the product of beta- 
galactosidase/SlpIII gene fusion; (b) immunoblot of same product with anti-Sip 
antibody. 

Fig. 10: Construction flowchart for plasmid pSY1280. 

20 

DESCRIPTION OF SPECIFIC EMBODIMENTS 
Novel polypeptides are provided which are block polymers of repeating, 
relatively short, amino acid sequence units. The blocks of repeating units 
(oligomers) may be linked by spacers of different amino acid sequences. The 

25 polypeptides may contain only one or a plurality of repetitive amino acid sequences 
(having the same or different amino acid sequences). The novel polypeptides are 
particularly useful as fibrous or structural proteins, including crystalline, 
elastomeric, tough and bony materials, e.g. proteins similar to , but different from, 
silk, elastin, collagen, keratin or other naturally occurring structural polymers 

30 having a repetitive amino acid sequence motif. The gene encoding the repeating- 
unit-containing peptides is produced to particularly avoid problems previously 
associated with genes containing multiple repeating units. 
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Genes produced according to the methods described herein will generally be 
at least 900 nt in length, usually at least 1200 nt in length, preferably at lest 1500 nt 
in length, usually not more than 20 km in length, more usually not more than 12 knt 
in length, frequently not more than about 6 knt in length. This will usually result in 

5 a protein of from about 30 kDal, usually at least 35 kDal, and not more than about 
250 kDal, more usually not more than about 125 kDal. The methods for production 
of the synthetic genes encoding the protein polymer involve preparation of a dsDNA 
"monomer", which is an extended segment of DNA principally encoding amino acid 
repeating units, where the dsDNA monomer is generally a repeating segment of the 

10 final product, where the final product will have from 2, frequently at least 3, and up 
to 50, usually not more than about 30, more usually not more than about 20, 
monomeric units. There is one exception, to be described below, where the 
monomer may be the entire final repeating unit gene. The monomer will be a 
dsDNA whose sequence is, with one exception, established prior to its 

15 multimerization to provide the gene. 

The size of the dsDNA monomer is dependent upon the desired amino acid 
monomer sequence as well as the way in which the monomer is obtained. If the 
monomer is constructed using any newly synthesized and ligated DNA, then the 
monomer is always sequenced prior to multimerization and the practical limitations 

20 of DNA sequencing technology limit the monomer size to about 500 nt, usually 
about 400 nt. If the gene monomer is constructed solely from digestion fragments 
of previously constructed and sequenced monomers, then the final gene monomer is 
typically characterized by restriction digests. Therefore, the gene monomer can be 
as large as the final gene, depending upon the desired amino acid repeating unit 

25 sequences and periodicity. 

Because of the nature of the subject genes, which involve long tracts of 
repetitive units encoding the same amino acid sequence, the manner in which the 
monomer gene may be successfully prepared is restricted and requires that there be 
certainty as to the fidelity of the sequence. In the initial methods of preparation, 

30 there were a number of stages: preparation of ssDNA pairs, which overlapped and 
when hybridized provided segments; cloning of segments; sequencing of segments to 
ensure the fidelity of the sequence; and combining the segments to form the 
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monomer. The order and manner in which these various operations were carried out 
could be varied, depending on the nature of the gene and the protein it encoded. In 
order to reduce the possibility of undesirable recombination events, different nucleic 
acid sequences are used to encode the same repetitive unit. 
5 There are three ways to obtain the monomer. The first way relies on 

synthesis and assembly of single stranded deoxynucleotide oligomers encoding from 
about 1 to 12, more usually 1 to 9, frequently 1 to 6 repeating amino acid units into 
a dsDNA monomer sequence. Each repeat unit will have about 3 to 30 codons (9 to 
90 bases), usually about 3 to 25 codons, more usually about 3 to 15 codons, 

10 frequently not more than 9 codons, particularly when mimicking a naturally 
occurring motif. The number of amino acid repeat units in a dsDNA monomer 
sequence will depend to a substantial degree on the size of the repeating unit. 
Conveniently, ssDNA oligomers may be prepared having from about 15 to 120 
bases, usually about 21 to 90 bases, more usually about 39 to 72 bases, although 

15 oligomers may be prepared with up to 300 bases, more usually up to about 252 

bases. For repeating units having a few amino acids, usually in the range of 3 to 15 
amino acids, more usually in the range of 3 to 9 amino acids, the single stranded 
oligomer will conveniently have from about 2 to 12 amino acid repeating units. 

The number of different single stranded oligomers will usually be at least 2, 

20 forming I pair, more usually about 6, forming 3 pairs, or may be 8 or more, 
forming 4 or more pairs, where the protein polymer has the same repeating unit. 
Where block copolymers are prepared, the number of oligomers will depend on the 
number of different blocks and the size of the blocks. The dsDNA segments formed 
by the pairs of oligomers of the different ssDNA oligomers may encode the same 

25 amino acid sequence or a different amino acid sequence, but where more than one 
dsDNA segment is synthesized, at least two segments will have different nucleotide 
sequences. Each pair of oligomers forming a dsDNA segment are complementary 
and at least partially overlap, providing blunt or cohesive (protruding) ends, 
preferably protruding ends, to allow for ease of assembly and ligation of the dsDNA 

30 to form a "monomer 1 '. 

The dsDNA segments are desirably assembled in a prokaryotic vector by 
linearizing a vector having an origin and convenient restriction sites, which may 
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involve a polylinker, for insertion of one or more dsDNA segments. The vector will 
also have a marker gene for selection, which will usually impart antibiotic 
resistance, but may afford another distinguishing characteristic, e.g. chromophore or 
fluorophore formation. The marker will preferably provide antibiotic resistance, 

5 there being a wide variety of antibiotic reagents, e.g. tetracycline, chloramphenicol, 
actinomycin, neomycin, ampicillin, hygromycin, heavy metals, etc. Other markers 
include 6-galactosidase, which, with the substrate X-gal, provides a blue color. 
Numerous vectors are commercially available for cloning in E. coli and need not be 
exemplified here. The vector is then introduced into an appropriate cloning host by 

10 any convenient means, including calcium phosphate precipitated DNA, fusion, 
transfection, conjugation or the like. The cells are then grown in an appropriate 
selective nutrient medium. Surviving cells are harvested, lysed and the plasmid 
isolated. 

By having a multiplicity of dsDNA segments, the termini may be designed 
15 that the first segment has a 3' or 5' terminus complementary to the 5' or 3' terminus 
of a second segment and so on, where the termini may have different consensus 
sequences for different restriction enzymes or not be recognized by any known 
restriction enzyme. The termini of the dsDNA segments may be selected to have 
protruding 5' ends, protruding 3' ends, or a protruding 5' and a protruding 3' end 
20 on the same strand, either the coding strand or the non-coding strand. 

Complementation of the protruding ends may destroy the sequence of the restriction 
site or retain the sequence, when different dsDNA segments are ligated. 

Restriction enzymes are used to digest the dsDNA of the cloning vector and 
insert dsDNA segments. Restriction enzyme digestion of the vector, whether or not 
25 already having inserted dsDNA segments, will provide termini which are 

complementary to the termini of the next dsDNA segment being inserted, which 
may already be a combination of two or more synthesized dsDNA segments. In 
selecting dsDNA segment sequences, one generally selects the terminal sequences to 
allow for linearization of the vector and insertion of the next dsDNA segment, 
30 proximal to a terminus of a segment. However, sometimes it is convenient in 

creating a monomer to insert a dsDNA segment within a previously cloned segment. 
The final dsDNA segment sequence which is inserted into the cloning vector may 

8 
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not be the entire dsDNA segment which was synthesized, but will have the 
appropriate complementary termini that allow for insertion formed by the restriction 
enzyme digestion. Similarly, digestion of the vector may create the appropriate 
termini for insertion of the next dsDNA segment by restriction enzyme digestion that 

5 deletes a portion of the originally synthesized and cloned DNA. Generally, in 
building a monomer it is preferable to cleave the vector with a single restriction 
enzyme corresponding to a single restriction enzyme recognition site so that the use 
of partial enzyme digestions can be avoided. 

By having different termini at each end of each dsDNA segment, the 

10 individual segments cannot oligomerize, even if they have been phosphorylated. In 
this way, when the different segments are combined, the ends of the combination of 
the segments may have complementary termini, so that they can be oligomerized. 
The 3 ' and 5 ' termini of each dsDNA segment are generally selected so that only 
one copy of the segment can be cloned in one orientation into a cloning vector. 

15 However, sometimes it is convenient to have complementary 3' and 5' termini, so 
that a segment can be cloned into an existing monomer or portion of a monomer 
even though it is then necessary to select a clone having the segment inserted in the 
correct orientation and number. During the construction of the monomer, some 
combinations of dsDNA segments may not be in reading frame, one to the other. 

20 However, by appropriate selection of the dsDNA segment sequences and the 
restriction enzymes used in the monomer construction, the final combination of 
dsDNA segments comprising the monomer will be in a continuous open reading 
frame coding for the desired amino acid sequence. The above-described approach 
may be used with the other methods of forming the monomer, as will be described 

25 hereafter. 

Using these techniques and design strategies, it is possible to construct the 
dsDNA monomer in a variety of ways, as described in the examples that follow. In 
one variation, a first dsDNA segment is cloned into the cloning vector after it has 
been linearized by restriction enzyme digestion. After cloning, the first dsDNA 
30 segment is characterized, such as by restriction analysis and sequencing. Where the 
dsDNA segment is relatively small, sequencing can be performed rapidly and 
substantially error free. 
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Once the first dsDNA segment has been shown to have the correct sequence, 
the vector may then be used in the next stage in the preparation of the gene. The 
vector is linearized at the 5' or 3' terminus of the first dsDNA segment cloned. By 
employing a polylinker in the vector at the 5' and/or 3' terminus of the dsDNA 

5 segment cloned, the vector may be digested by using a restriction enzyme which 
cleaves in the polylinker to provide a terminus at the 5' or 3' terminus of the vector 
complementary to the 3' or 5' terminus of the next dsDNA segment. Alternatively, 
one may use restriction enzymes which cleave distal from the consensus sequence. 
In this way the vector may be repeatedly cleaved and ligated, without cleavage of 

10 the monomer DNA being constructed. After cloning, the combined dsDNA 

segments may be characterized as described above. The process may be repeated 
until all of the dsDNA segments have been inserted and verified for sequence and 
being in the proper order and reading frame. 

In another variation, two or more dsDNA segments may be sequentially 

15 cloned as described above, with each new insertion at the 3' or 5' terminus of the 
previously cloned segment, and then another dsDNA segment may be inserted 
between the previously cloned segments. In another variation, a first dsDNA 
segment may be cloned as described above and then another dsDNA segment may be 
inserted internal to the cloned segment. In another variation, two or more dsDNA 

20 segments may be cloned simultaneously into a vector, with additional dsDNA 
segments sequentially inserted a) at the 3' or 5' termini of the previously cloned 
DNA segments, b) between the previously cloned DNA segments, or c) internal to a 
previously cloned segment. In another variation, all dsDNA segments comprising 
the monomer may be simultaneously ligated into the cloning vector. In another 

25 variation, each dsDNA segment comprising the monomer may be individually 
cloned and characterized The individual dsDNA segments are then purified and 
ligated in a single cloning step to construct the monomer, which is sequenced. 

An essential element in this method for constructing the monomer is that the 
pairs of ssDNA be annealed into dsDNA segments prior to further manipulation. 

30 Generally, each dsDNA or combination of segments, once cloned, is sequenced 
prior to further manipulation. The monomer is always sequenced prior to 
multimerization. 
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A second approach depends on the synthesis of a single strand of all or a 
portion of the monomer. Synthetic techniques allow reasonably accurate 
oligonucleotide synthesis of 300 bases or more. For the most part the single strand 
will be in the range of about 100 to 300 bases, usually in the range of about 100 to 
5 250 bases. The single strand is then used to produce a complementary strand, 
conveniently using the polymerase chain reaction ("PCR"). The synthesized PCR 
template comprises the appropriate sequence of the ssDNA monomer flanked by 
appropriate primer binding site sequences. 

The primers which are used for PCR are designed not to hybridize readily to the 

10 repeat sequences, having overall a nucleotide sequence substantially different from the 
nucleotide sequence of the sequence being amplified, although a portion of the primer 
desirably includes a sequence common with the non-repetitive portion of the sequence 
being amplified, this will be not more than about 30 nucleotides, usually not more than 
about 25 nucleotides and at least 10 nucleotides, usually at least 12 nucleotides. The 

15 total number of nucleotides in the primers will generally be in the range of about 1 5 to 
50, more usually 20 to 45. In the portion of the primer hybridizing with the target, 
none or a portion of a restriction enzyme consensus sequence is included in the primer 
which binds to the target ssDNA, where only a portion is present, the target will 
complete the consensus sequence. Restriction enzyme digestion allows for cleavage at 

20 the terminus or proximal to the terminus of the primer sequence hybridizing to the 

target ssDNA. For the most part, where a portion of the restriction enzyme consensus 
sequence is included in the primer, restriction enzyme digestion will maintain the 
monomer amino acid motif. Preferably, not more than about 5 nt, usually not more 
than about 3 nt, preferably not more than 2 nt, will be left from the primer after clevage 

25 by restriction enzyme digestion. Usually, the consensus sequence will be 6 to 8, usually 
6 nucleotides for the restriction enzyme. The 3' and 5' primers will be different, so 
as not to hybridize with each other. In order for the primers to work efficiently, the 
primers must have similar T m and AG characteristics for hybridizing with their 
respective complementary DNA sequences. These characteristics can be controlled by 

30 the length of the primers, the length of the sequence hybridizing with the target ssDNA, 
and the proportion of GC binding that occurs. Generally the primers will have at least 
40%, more usually at least 45% G and C, and may be 50% or more, usually, not 
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greater than about 75%, more usually not greater than about 65%. For hybridization 
between primers and their respective complementary binding sites, the T m will generally 
be in the range of about 84 to 90°C usually about 85 to 89°C, more usually about 87 °C 
± 1° C . The T m difference between the pair of primers will be less than about 2°C, 
5 usually about 1 .5°C, more usually less than about 0.75 °C. Such T m s are determined by 
calculation with the software program OLIGO® PRIMER ANALYSIS SOFTWARE 
version 4.0 for Macintosh computers, assuming a salt concentration of 1000 mM and 
an oligonucleotide concentration of 0.6 picomoles. The T m for hybridizing of the 
primer to the target ssDNA will generally be in the range of about 68 to 73, usually 

10 about 71 °C.± 1 , having similar restrictions as regard the whole primer. 

In addition, because the primers will be selected so that not more than a small 
portion of the primer will participate in the final gene sequence, a restriction site will be 
present to permit removal of the primer sequence, leaving the resulting sequence with 
ends which will allow for multimerization, particularly cohesive ends, where the ends 

15 may recreate the restriction site or the restriction site may be destroyed. Thus, the 
cleavage site may occur within the consensus sequence or away from the consensus 
sequence, while providing for complementary termini. 

The resulting dsDNA segment(s) are then digested by restriction enzyme 
digestion to remove the primer binding sites, cloned in a cloning vector, and if more 

20 than one segment has been synthesized, assembled in a cloning vector as described 
above to form the monomer. The resulting dsDNA monomer is cloned, purified and 
sequenced to ensure that it has the correct sequence. The monomer prepared this 
way will have the same limitations as to size and the number of amino acid repeating 
units which are encoded as the monomer prepared by the sequential and/or 

25 simultaneous cloning of dsDNA segments as described above. 

After the monomer has been prepared, characterized and the desired 
sequence confirmed, the monomer may then be excised from the vector and purified 
in accordance with conventional procedures. At this time the "monomer "synthesis 
has been completed. The monomer may then be used to produce the gene. As is 

30 evident from the above descriptions, dsDNA segments prepared as in (1) and/or (2) 
above can be combined to form a monomer, which is sequenced. 
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The third approach relies on the use of fully characterized dsDNA which is 
already present in a monomer, previously prepared by either of the methods 
described above. Using this approach allows for great flexibility in constructing 
new monomers, particularly where copolymers comprising different amino acid 
5 repeating units are desired. Using the appropriate restriction enzymes, all or part of 
the dsDNA comprising a monomer may be purified. In some instances, all or a 
portion of a previously synthesized monomer is then used as a new monomer in 
combination with newly synthesized dsDNA segments prepared as in (1) and/or (2) 
above or a portion of a previously synthesized monomer is then used as a new 

10 monomer by itself. In other instances, the desired dsDNA from two or more 
separate monomers may be combined to construct a new monomer encoding the 
amino acid repeating units of interest, either by themselves or in combination with 
newly synthesized dsDNA segments prepared as in (1) and/or (2) above. The 
digested monomer DNA fragments which are to be combined may have 

15 complementary or non-complementary ends. If the termini of the monomer 
sequences are not complementary, as required, the termini may be made so by 
employing adapters, filling in, nuclease digestion, or the like. Once the appropriate 
monomer sequences have been cloned together, sequentially and/or simultaneously, 
to make the new monomer, the monomer is then characterized and sequenced, if 

20 necessary. If newly synthesized adapters or filling in reactions or nuclease digestion 
or the like are employed, the region comprising the modified monomer DNA is 
sequenced. 

When the gene encoding the desired protein product is a homooligomer of 
the monomer, desirably the termini have cohesive ends and may retain the same 

25 restriction site consensus sequence or result in a sequence other than the consensus 
sequence. By appropriate choice of the restriction enzymes or poly linker, the 
termini of the monomer may have the same or different terminal restriction sites, but 
will have complementary ends, if the monomer is to be multimerized. Preferably, a 
single restriction enzyme that cleaves the monomer from the vector at asymmetric 

30 consensus sites will be employed. However, restriction enzymes that cleave outside 
of the recognition sites may also be used. By having a monomer with different, yet 
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complementary termini, the monomer may be ligated in vitro with the monomers 
only assembling in one orientation. 

As evidenced by the above description, the monomer is a molecule which 
will normally be comprised of a plurality of previously prepared dsDNA segments, 
5 normally being formed from at least two different dsDNA segments, which may or 
may not encode the same amino acid sequence, but generally providing for blocks of 
the same pattern of repeat amino acid units throughout the final polymer gene (the 
exception is when the monomer is the gene). Thus the monomer may provide for a 
homopolymer, copolymer, or polymer having a defined motif, where the amino acid 

10 repeating units vary, e.g. collagen. 

The monomer is then multimerized by ligation, conveniently employing from 
about 0.01 to 100 /ig of the monomer under ligating conditions, where multimers 
having different numbers of monomers are obtained. The multimers may then be 
segregated by size, selecting multimers of a predetermined size. Any of the original 

15 mixture, the partially purified mixture, or size segregated fractions thereof, may 
then be introduced into a vector. Either an adapter vector or an appropriate 
expression vector is employed. The adapter vector has a polylinker which will allow 
for insertion into the polylinker, so as to be capable of being read in any reading 
frame. In this way one may introduce different unique restriction sites which allow 

20 for excision and transfer of the multimer gene from the expression vector. The 
muliimer gene may be characterized and purified before transfer to the expression 
vector. The multimer will have appropriate termini which will allow for insertion 
into the vector and, as appropriate, have end groups which are present in the vector 
or be inserted with termini which will allow for the exact excision of the gene. One 

25 may select a particular sized multimer or a plurality of multimers of different size 
for expression, so that one has a family of protein polymers, sharing the same 
repeating motif. 

The expression vector will be characterized by having an origin of replication 
which is functional in an appropriate expression host, usually for episomal 
30 maintenance, and a marker for selection. Markers as described above may find use. 
For unintegrated vectors or constructs, the origin of replication will usually provide 
for multicopies, usually greater than about 5 copies on the average. 

14 
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The expression vector will also have a promoter which is functional in the 
expression host. Various promoters can find use, which provide for a high level of 
transcription, either inducible or constitutive transcription. Illustrative promoters 
include P-lactamase, P-galactosidase, AP L or AP R promoters, trpE promoter, trp-lac 
5 promoter, T7 promoter (particularly genes 9 and 10), cP, etc. The multimer gene 
and the linearized vector may be combined under hybridizing, usually including 
ligating, conditions. Where the multimer gene does not have an initiation codon, 
such a codon can be added. More conveniently, the multimer gene may be inserted 
into a coding sequence present in the vector, under the transcriptional control of a 

10 promoter. The coding sequence in the vector will generally not exceed 200 bp, 
usually not exceeding about 60 bp, where the site into which the multimer gene is 
inserted has the coding sequence and multimer gene in proper reading frame. 
Generally, the coding sequence present in the vector will be not more than 20%, 
usually fewer than 10%, preferably fewer than about 5% of the total number of 

15 bases in the coding sequence. 

A signal sequence may be present at the 5' terminus of the coding sequence 
to allow for secretion of the protein polymer into the periplasmic space. For the 
most part, the product will be produced intracellularly. 

Instead of a vector, DNA constructs may be employed for transformation of 

20 the expression host, with integration of the construct into the genome of the 

expression host. The construct will differ from the vector primarily by lacking an 
origin which provides for episomal maintenance. Thus, the construct will provide at 
least transcriptional and trans lational initiation and termination regions, the gene 
encoding the protein polymer between the initiation and termination regions and 

25 under their regulatory control, a marker for selection as described above, and other 
functional sequences, such as homologous sequences for integration into the host 
genome, sequences for priming for the polymerase chain reaction, restriction sites, 
and the like. 

For the most part, the expression host will normally be unicellular, 
30 prokaryotic or eukaryotic, but may be from a multicellular organism. The organism 
may be selected from bacteria, algae, fungi, insect cells, plant cells, etc. Illustrative 
hosts include £. coli t B. subtilis, B. stearothermophilus, S. cerevisiae, and the like. 

15 
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The expression host is then grown in accordance with conventional ways in 
an appropriate medium in culture, e.g. fermentation. After the cells have been 
grown to an appropriate density, the cells may be harvested, lysed and the product 
isolated by appropriate means, in accordance with the physical and chemical 
5 characteristics of the product. In some instances, the product is insoluble at 
moderate temperatures in an aqueous medium, and may be purified by detergent 
extraction at mildly elevated temperatures, above about 60°C, see U.S. Patent No. 
5,235,041 . As appropriate, the crude or purified product may then be used for its 
intended purpose. 

10 The genes of the subject invention generally comprise concatenated 

monomers of DNA encoding the same amino acid sequence, where all or a part of 
two or more different monomers encoding different amino acid repeating units may 
be joined together to form a new monomer encoding a block copolymer. The 
individual amino acid units will have from 3 to 30 amino acids (9 to 90 nt), usually 

15 3 to 25 amino acids (9 to 75 nt), more usually 3 to 15 (9 to 45 nt) amino acids, 
usually having the same amino acid appear at least twice in the same unit, generally 
separated by at least one amino acid. In some instances, the minimum number of 
amino acids will be 4. Within a monomer, dsDNA encoding the same amino acid 
repeating unit may involve two or more nucleotide sequences, relying on the codon 

20 redundancy to achieve the same amino acid sequence. 

For the most part the DNA compositions of this invention may be depicted 
by the following formula: 

K k (W M r X x N $ Y y X L { 

wherein: 

25 K is a DNA sequence encoding an amino acid sequence of from about 1 to 

125 amino acids, usually ,1 to 60 amino acids, which may be any sequence 
depending upon the manner of preparation of the construct and the purpose of the 
protein product, generally being fewer than about 20% of the total number of amino 
acids, more generally being fewer than about 10% of the total number of amino 

30 acids, which may be any sequence, particularly a naturally occurring sequence 
where the multimer structural gene has been fused to another DNA sequence in 
reading frame. K, if present, will have the initiation methionine codon. L may be 

16 
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the same or different from K t coming within the definition of K, but lacking the 
initiation methionine codon.; 

k and 1 are the same or different and are 0 or 1; 

W has the formula: 
5 [<A) n (B) p ] q 
wherein: 

A is a DNA sequence coding each time that it appears for the same amino 
acid repeating unit normally having at least one amino acid appear at least twice in 
the sequence, where A will generally be from about 9 to 90 nucleotides (nt) t more 
10 usually from about 9 or 12 to 75 nt, preferably from about 9 or 12 to 45 nt, more 
preferably from about 9 or 12 to 30 nt, and in some instances may be as few as 24 
nt: 

where there will usually be at least two different A's, usually not more than 
ten different A's, more usually not more than six different A's, which code for the 

15 same amino acid sequence but differ from each other by at least one nucleotide and 
may differ by as many as ten nucleotides, usually not differing by more than about 
five nucleotides from another A sequence, each of the different A's usually being 
repeated at least twice; at least two different codons are employed for the same 
amino acid, e.g. GGC and GGA for glycine, in different A*s coding for the same 

20 amino acid sequence unit; 

n will be an integer of at least 2 t usually at least about 4, more usually at 
least about 8, and not more than about 250, usually not more than about 200, 
frequently not more than about 125, and in some instances may not exceed about 50; 
B is a DNA sequence different from A coding for an amino acid sequence 

25 other than the amino acid sequence unit coded by the A unit and serves as a linking 
unit between oligomers of A units. B will generally have from about 3 to 45 nt, (1 
to 15 amino acids) more usually from about 3 to 30 nt (1 to 10 amino acids); 

where the B units appearing in the gene may be the same or different, there 
usually not being more than about 10 different B units, more usually not more than 

30 about 5 different B units, where the B units may differ from about 1 to 45 nt, more 
usually from about 1 to 15 nt, where the different B's may code for the same or 
different amino acid sequence; 
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p is 0 or 1 and may differ each lime there is a successive A unit; 

q is an integer of at least 1 and will vary with the number of nucleotides in A 
and B, as well as the values of n and p. The variable q will be selected so as to 
provide for at least 90 nt for the multimeric portion of the structural gene, preferably 
5 at least about 150 nt, more preferably at least 450 nt, and most preferably at least 
900 nt, and the number of nucleotides will usually not exceed about 10,000 nt, more 
usually not exceeding about 8,000 nt, generally being in the range of about 900 to 
6,000 nt, more usually to about 5,000 nt; and 

M is a DNA nucleotide sequence of about 12 to 150 nt, usually being 18 to 
10 150 nt, more usually not more than about 90 nt, which may encode any amino acid 
sequence, usually encoding a functional sequence which provides for a natural or 
synthetic sequence resulting in a biological or chemical function or activity; 

r and s are the same or different, being 0 to 3, usually 0 to 2, depending on 
whether a functional group is present in the polymer, usually being 1 to 2, where 
15 different, the same or similar functional groups may be combined in a contiguous 
manner; 

N is the same or different from M and comes within the same definition as 

M; 

X may be the same as or different from W, usually different, and will have 
20 the formula: 

[A')„i (B l ) P i] q i 

wherein: 

A 1 , B l , N 1 , p 1 and q 1 are the same as or different from A, B, n, p and q 
respectively, at least one being different, wherein the analogous symbols come 
25 within the same definition as their counterparts; 
x is 0 or 1 ; 

Y may be the same as or different from W, usually different, and will have 
the formula: 

[A 2 )„2 (B 2 )p2]q2 

30 wherein: 
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A 2 , B\ n 2 , p 2 and q 2 are the same as or different from A, B, n, p and q 
respectively, at least one being different, wherein the analogous symbols come 
within the same definitions as their counterparts. 

y is 0 or 1; 

5 i is 1 to 100, usually 1 to 50, more usually 1 to 30, particularly 1, when x, 

y, r and s are 0; 

when x or y are 1, q, q 1 and q 2 will be a total of at least 2, usually at least 5 
and not more than about 50, usually not more than about 33. 

The total number of nucleotides will be at least 900 nucleotides, usually at 
0 least about 1200 nt, preferably at least about 1500 nt and may be 20 knt 

(kilonucleotides), usually not more than about 6 knt, more usually not more than 
about 4 knt. 

The polypeptide encoded by the above DNA sequence will have the 
following formula: 
5 K^W'M^N'.Y'^L', 
wherein: 

W will have the following formula 

[(D)„ (E) p ] q 

wherein: 

3 D is the amino acid sequence encoded for by A and therefore has the 

numerical limitations based on 3 nucleotides defining a codon that codes for one 
amino acid; 

E is the amino acid sequence encoded for by B, and therefore has the 
numerical limitations based on 3 nucleotides defining a codon, where each E may be 
5 the same or different, depending upon the coding of B; 

and, wherein, likewise K\ W\ M\ X\ N\ Y' and L" is the amino acid 
sequence encoded for by K, W, M, X, N, Y and L respectively. However, in the 
case of K and L, subsequent processing, such as protease treatment, cyanogen 
bromide treatment, etc. may result in partial or complete removal of the N- or C- 
3 terminal non-multimeric chains. 

n, p, q, k, r, s, x, i and 1 have the same definitions as previously indicated. 
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Particular polymeric compositions having amino acid repeating units having 
the same compositions (A) will have the following formula where x and y are 0, 

K\ [(D) n (E) p ] q L\ 
where all of the symbols have been defined previously; and 
5 the DNA sequence will have the formula 

K^AUB^L, 
where all of the symbols have been defined previously. 
Particular DNA sequences encoding copolymeric compositions having a 
repeating unit of two to three multimeric blocks will have the following formula: 

10 K\(W M M\X^N w $ Y%Vi/i 
wherein: 

W" is a multimer having the formula 

[(A 3 ) n > (B 3 ) P 3] q3 , 

where A 3 is of 3 to 15, usually 4 to 6 codons, otherwise coming within the 
15 definition of A; 

n 3 will be from about 2 to 40, usually 2 to 32; 
B 3 is of from 2 to 20, usually 4 to 6 codons; 
p 3 is 0 or 1 ; 

q 3 is of from about 2 to 50, usually 2 to 30, depending on the value of n\ as 
20 discussed previously for n and q; 

X" and Y" are the same as or different from W", usually different, coming 
within the same definitions as W"; 

M" and N" come within the definitions of NT and N'; 

i M is at least 2, usually at least 5 and not more than about 75 , usually not 
25 more than about 50, generally not exceeding 30; 

with the other symbols as defined previously, wherein at least one of x and y 

is 1. 

The compositions of the invention will usually have a molecular weight of at 
least about 30 kDal, usually 50 kDal, frequently at least about 60 kDal and may 
30 have molecular weights as high or higher than 500 kDal, usually not exceeding 300 
kDal, more usually not exceeding about 250 kDal, and in many instances not 
exceeding 125 kDal, the higher ranges generally being the multimer combinations, 
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with the individual multimer usually being less than about 150 kDal f usually less 
than about 100 kDal. 

The nucleotide sequences which are employed will be synthesized, so that the 
repetitive units will have different codons for the same amino acid as described 
5 above. Usually, at least about 25%, more usually at least about 40%, and generally 
at least about 60%, but not greater than about 95%, preferably not greater than 
about 90% of the nucleotide sequences encoding the repetitive units will be the 
same. Greater diversity within those ranges will be employed where the initial 
constructs are experimentally shown to undergo spontaneous recombination events. 

10 Of particular interest are polypeptides which have as a repeating unit 

SGAGAG (G = glycine; A = alanine; S = serine), where the choice of the S as the 
initial amino acid of the repeating unit is arbitrary, since except for the first and last 
units, all the other units will be the same. This repeating unit is found in a naturally 
occurring silk fibroin protein, which can be represented as 

15 GAGAG(SGAGAG) 8 SGAAGY(Y = tyrosine). In the subject invention, the 
repeating unit is designed where the N-terminus may be MGAGAG or any other 
sequence of generally at least about 3 amino acids, usually at least about 5 amino 
acids, more usually 12 amino acids and not greater than 235, usually not greater 
than 100 amino acids, which may be different from the repetitive unit. Generally, a 

20 different N-terminus will be the result of insertion of the gene into a vector in a 
manner that results in expression of a fusion protein. Any protein which does not 
interfere with the desired properties of the product may provide the N-terminus. 
Particularly, endogenous host proteins, e.g. bacterial proteins, may be employed. 
The choice of protein may depend on the nature of the transcriptional initiation 

25 region. Similarly, the C-terminus may have an amino acid sequence different from 
the repeat sequence. Conveniently, there may be from 1 to 125, frequently 1 to 
100, usually 1 to 25 amino acids, which may be the C-terminus of a naturally 
occurring structural gene, which again typically results from the formation of a 
fusion product. 

30 A silk-like-protein (Sip) gene may be produced by providing oligomers of 

from about 5 to 25 repeat units as described above, more usually of about 10 to 20 
repeat units. By having different cohesive ends, the oligomers may be 
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concatemerized to provide for the polymer having 2 or more of the oligomeric units, 
usually not more than about 50 oligomeric units, more usually not more than about 
30 oligomeric units, and frequently not more than about 25 oligomeric units. 

The silk-like proteins may be varied by having alternate multimers with the 
5 same or different handedness. For example, in the formula, (B) p may provide an 
even or odd number of amino acids. In silk, the hydrogens of the glycine may align 
on one side and the methyls and hydroxyls of alanine and serine on the other. If 
(B) p is even, there will be continuous alignment, if odd, there will be alternating 
alignment of (A) n . Thus, different properties can be achieved by changing the 

10 number of amino acids encoded by (B) p . 

Of particular interest are polypeptides which mimic the composition and 
physical properties of silks found in nature, e.g. Bombyx mori. 

Also of interest are polypeptides which have as a base repeating unit GVGVP 
(G = glycine, V = valine, P = proline), which may be found in naturally occurring 

15 elastin; also VPGVG and/or APGVGV units, where again the choice of the initial 
amino acid in the repeating unit is arbitrary. In the subject invention, the N- 
terminus may be any convenient sequence and, if desired, may be in whole or in 
pan removed by a protease. Usually the N-terminal sequence which does not have 
the subject motif will be less than about 125, frequently less than about 100 amino 

20 acids, more usually less than about 60 amino acids. 

Of particular interest is a base sequence of about 2 to 32, preferably 8. units 
separated by a sequence of about 3 to 50 amino acids, usually 12 to 48 amino acids, 
which may include an internal repeat of from 3 to 15 amino acids different from the 
basic repeating unit. For example, the second repeat sequence could be GAGAGS, 

25 repeated twice. The total number of base repeating units will generally be in the 
range of about 150 to 500, more usually 150 to 300, and more usually 175 to 250. 
The C-terminus may terminate with a repetitive unit or portion thereof or a different 
sequence of from 1 to 125, usually 1 to 50 amino acids. The C-terminus is not 
critical to the invention and will be selected primarily for convenience. As with the 

30 N-terminus, it may be designed for proteolytic cleavage. As in the case of the silk 
protein, the subject elastin-like protein may be similarly engineered. 
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Of particular interest are proteins which mimic the properties of elastin and 
provide for elastomeric properties and the use of elastin blocks to impart different 
physical properties, e.g. elastic properties or to modify the solubility properties of a 
different repetitive unit polymer. 

5 Of particular interest are collagen like proteins which have the sequence 

Gap, where a and p may be any amino acid, particularly one being proline. 
Usually in the protein a and p will be selected so that the total percent proline in the 
protein is between about 10 to 45 number % of the amino acids in the protein. The 
amino acids of particular interest other than glycine and proline are alanine, 

10 isoleucine, leucine, valine, serine, threonine, asparagine, glutamine, lysine, 
arginine, aspartic acid, glutamic acid, histidine. By known procedures after 
production of the protein, one or more prolines may be oxidized to hydroxyproline. 

Also of interest are the polypeptides which have as a repeating unit K-L-(l)- 
L-a-E-A where 1 is a basic or acidic amino acid, particularly K or E and the 

15 repeating units alternate as to whether 1 is a basic or acidic amino acid. This 
structure is commonly found in keratin. 

The copolymer involving repeating units is a powerful method for varying 
properties, by appropriate choice of the different units, the number of units in each 
multimer, the spacing between them, and the number of repeats of the multimer 

20 combination assembly. Thus, by varying the number and arrangement of primary 
monomers, a variety of different physical and chemical properties can be achieved. 

Exemplary of the use of the block copolymers are combinations of silk units 
and elastin units to provide products having properties distinctive from polymers 
only having the same monomeric unit. 

25 The repetitive proteins can find a variety of uses. The Sip proteins may be 

used in producing fibers having unique properties, as a substitute for silk, and the 
like. Collagen proteins can be produced, where the collagen is free of the 
telopeptide or contains the telopeptide, depending upon its function. Atelopeptide 
collagen should have little if any immunogenicity, so as to be a useful structural 

30 element for a variety of prosthetic devices or for use as a collagen substitute in other 
applications. Similarly, other proteins having repetitive sequences, such as keratin, 
can also be prepared in accordance with the subject invention. Other useful 
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repetitive proteins can be prepared based on sequences of spider silks and other 
repetitive animal fibers. Artificial peptides useful for immunization could also be 
prepared based on repeating sequences present in various surface antigens of disease- 
causing microorganisms, such as parasites, bacteria, and viruses. 
5 The following examples are offered by way of illustration and not by way of 

limitation. 



Example 1 

DNA Preparation Methods 
10' 1. Preparation of plasmid DNA from E. colt 

A. Small scale: Plasmid DNA was prepared from 1.5 ml cultures by 
either the boiling procedure or the alkaline lysis method (Maniatis, et al., Molecular 
Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring 
Harbor. (1982)). 

15 B. Large scale: A plasmid-carrying strain was grown overnight in 1 liter 

of Luria broth with the appropriate antibiotic. The cells were collected by 
centrifugation at 10,000xg for 5 min and resuspended in 10 ml of ice cold TE 
(lOmM Tris-HCl pH 8, ImM EDTA). The cells were centrifuged again, 
resuspended in 4 ml of TES (TE and 25% w/v sucrose) and homogenized by 

20 vortexing. The samples were kept on ice for the following steps. Lysozyme (1 ml 
of 10 mg/ml) was added to the cell suspension and incubated for 5 min before the 
addition of 2 ml of 0.5 M EDTA pH 8. After 10 min incubation, 50 ml of 
proteinase K (40 mg/ml) were added followed 10 min later with 15 ml of lysing 
buffer (0.1 % Triton X-100, ImM EDTA, 50 mM tris-Hcl pH 8). After 15-20 min, 

25 the cell lysate was centrifuged at 35,000xg for 90-120 minutes. The supernatant 
(19.8 ml) was transferred to a plastic tube with 20 mg of CsCl and 400 /xl of 
ethidium bromide (10 mg/ml). After dissolution, the mixture was divided into two 
polyallomer ultracentrifuge tubes, sealed with heat and centrifuged in a Beckman Ti 
65 rotor at 60,000 rpm for 24 hr. The lower plasmid DNA band was removed from 

30 the tube with a hypodermic needle. The ethidium bromide was extracted three times 
with an equal volume of NaCl-saturated isopropanol. Two volumes of H 2 0 were 
added to the DNA solution, and then the DNA was precipitated with ethanol. 
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2. preparation of double-stranded DNA: 

A culture of JM103 was grown to an OD^cf about 0.2 and then divided 
into aliquots of 2 ml. Each aliquot was infected with a fresh plaque of M13 and 
incubated at 37 °C for about 6 hours with vigorous shaking. Then the cells were 
5 pelleted and the supernatant was saved for subsequent infections. The double- 
stranded phage DNA was extracted by the boiling method (Maniatis et al.). 

3. Deproteinization: 

Phenol extraction was performed on a convenient volume of DNA sample, 
10 typically between 100 /il to 10 ml. The DNA sample was diluted in 0.01M Tris- 
HC1 pH 7.5, ImM EDTA and an equal volume of water-saturated phenol was 
added. The sample was vortexed briefly and placed on ice for 3 minutes. After 
centrifiigation for 3 min in a microfiige, the aqueous layer was removed to a new 
tube and extracted once with an equal volume of chloroform :isoamylalcohoi (24:1). 

15 

4. Ethanol precipitation: 

DNA in an aqueous buffer was concentrated by ethanol precipitation. To the 
DNA sample was added 1/10 volume of 3 M sodium acetate pH 7.5 and 2-3 
volumes of cold ethanol. The DNA was precipitated for 30 min at -70°C or 
20 overnight at -20° C and then pelleted by centrifiigation in the microfuge for 15 min 
at 4°C. The pellet was washed once with 200 pi of cold 80% ethanol and pelleted 
again for 10 min at 4°C. After air drying or lyophilization, the pellets were 
resuspended in the appropriate buffer. 

25 5. Phosphatase treatment of DNA; 

A. Phosphatase treatment of DNA was performed by adding 1 pi (25 
units) of calf intestinal phosphatase (Boehringer Mannheim) directly to the 
restriction enzyme digestion reaction and continuing the incubation for 30 minutes at 
37oC. The phosphatase was inactivated for 60 min at 65°C prior to deproteinization 

30 by phenol extraction. 

B. Phosphatase treatment of DNA was also performed by resuspending 
ethanol precipitated DNA from the restriction enzyme digest in 20 mM Tris-HCl pH 
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8.0, 10 mM MgCl 2 to a final DNA concentration of 20 iig/ml Shrimp alkaline 
phosphatase ("SAP") was added at 2 U//xg of DNA and the mixture was incubated at 
37 °C for one hour, heat inactivated for 20 min at 65 °C and then passed through a 
Probind filter (Millipore) and subsequently a Bio-Spin column. The DNA was then 
5 ethanol precipitated and resuspended in suitable buffer. 

6. Phnphorvlation of DNA: 

Phosphorylation before annealing was performed by using Polynucleotide 
Kinase 3'-phosphatase-free (Boerhinger Mannheim). The reaction was carried out at 
10 37°C for 30 min in a 50 /xl reaction volume containing: 12.5 fig DNA, 5 fil lOx 
kinase buffer (0.5 M Tris pH 7.5, 10 mM Spermidine, 0.1 M MgCl 2 , 150 mM 
DTT, 1 mM EDTA), and 2 jil Polynucleotide Kinase (10 Ulfi\). After 
phosphorylation, salts and glycerol were removed from the DNA strands using a 
Bio-Spin 6 column (BioRad) equilibriated in TEAB. 

15 

7. Fill-in reactio n with DNA polymerase I: 

DNA was resuspended in buffer containing 50 mM Tris-HCl pH 7.4. 50 mM 
KCL 5mM MgCl 2 , and 400 j*M each of the four deoxynucleotide triphosphates. 
Ten units of Klenow DNA polymerase (BRL) were added, and the reaction was 
20 allowed to proceed for 15 min at room temperature. The DNA was then phenol 
extracted and ethanol precipitated. 

8. T4 polynucleotide kinase reaction: 

The reaction (10 fil) contained: T4 polynucleotide kinase (BRL) , 150 ng of 
25 DNA, 1 pi of 10 x kinase buffer (0.7 M Tris-HCl pH 7.6, 0.1M MgCl 2 , 50 mM 
DTT) and [ 32 P]-ATP (200-300 nCi). This was incubated at 37©C for 30 minutes and 
then the DNA was purified using a NACS column (Bethesda Research Labs). 

9. Digestion with restriction endonucleases: 

30 DNA was digested with restriction endonucleases (REN) in 1 x "AA M buffer 

[10 x AA buffer is 330 mM Tris-acetate, pH 7.9, 660 mM potassium acetate, 100 
mM magnesium acetate, 50 mM dithiothreitol (DTT) and 1 mg/ml bovine serum 
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albumin (nuclease free)]. Whenever possible, the concentration of DNA was kept 
below 1 fig/25 pi Incubation was at 37°C for 1-4 hrs for most restriction 
endonucleases except for Ball, Banl and Nael digestions which were incubated 
overnight. 

5 

10. Analytical aga rose gel electrophoresis of DNA: 

To DNA samples for gel analysis we added 0.2 volumes of loading buffer (5 
x electrophoresis buffer, 0.01 % bromphenol blue dye, 50 mM EDTA, and 50% 
glycerol). Then the samples were loaded into lanes of a horizontal submerged 

10 electrophoresis unit containing a 1 .0% (w/v) agarose gel. The electrophoresis buffer 
was either 1 x TAC or 1/2 x TBE. The 1 x TAG is 40 mM Tris-base, 10 mM 
EDTA, adjusted to pH 7.8 with acetic acid. The 1/2 x TBE is 0.045M Tris-base, 
0.045M boric acid, ImM EDTA, pH 8. The gel was run at 40-50V for 18 hr, then 
removed and stained with 0.5 jig/ml ethidium bromide for 30 minutes. The DNA 

15 bands were visualized on a long wavelength UV transilluminator. 

11. Preparative agarose gel electrophoresis: 

The procedures and materials are the same as for the analytical agarose gel 
electrophoresis. The only difference is the use of low melting point ("LMP") 
20 agarose, ranging in concentration from 0.5 to 2.5% (w/v) depending on the size of 
the DNA fragment to be purified. DNA restriction fragments were excised from the 
LMP agarose gels after visualization with ethidium bromide. For agarose ligation, 
the buffer used was lx TAE (50 mM Tris-acetate, pH 7.8). 

25 12. NACS purification: 

Gel fragments containing DNA were melted at 70 °C for 5 min and diluted 
approximately 5 fold with TE1 (10 mM Tris-HCl pH 7.5, 0.2M NaCl). The gel 
solution was applied to a NACS column (BRL). The column was washed with 5 ml 
of the same buffer. The bound DNA was eluted with 300 /*1 of either TE2 (10 mM 

30 Tris-HCI pH 7.5, 1.0 M NaCl) for DNA fragments smaller than 1000 bp or TE3 
(10 mM Tris-HCl pH 7.5, 2M NaCl) for larger fragments. The eluted DNA was 
concentrated by ethanol precipitation. 
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13. DNA ligation; 

Reactions for ligating cohesive ends contained: 1 fig DNA, 1 x AA buffer 
(see step 9, above) 1 mM ATP and 20 units of T4 DNA ligase (BRL) in a 20 /il 
final reaction volume. The ligation was allowed to proceed for 16-18 hr at 15<>C or 
5 1-2 hr at room temperature. For blunt-ended ligations the reactions contained 1 jig 
DNA, 25 mM Tris-KCl pH 7.5, 5 mM MgCl 2 , 5 rnM DTT, 0.25 mM spermidine, 
200 mg BSA, 1 mM hexamine cobalt chloride (HCC), 0.5mM ATP and 400 units 
T4 DNA ligase (NEB) in a 20 /xl reaction volume. The ligation was allowed to 
proceed for 30 min to 1 hr at room temperature. 

10 

14. Agarose DNA Ligation 

The agarose was melted at 65 °C, the temperature was then lowered to 37°C 
and ligation buffer (5x = 100 mM Tris-HCl, pH 7.5, 50 mM MgCl 2 , 50 mM DTT, 
1 mM ATP) was added; the tube was then placed at room temperature and ligase 
15 was added (1000 units T4 DNA ligase (NEB)), the reaction volume was usually 50 
III. The reaction was incubated at 15 °C for 16-18 hours. 

15. Use of Filters and Columns for DNA Purification . 

A. Ultrafree®-Probind filter unit ("Probind", Millipore): the DNA 
20 containing solution was applied to the filter unit and spun at 12,000 RPM for 30 

seconds in a Sorvall Microspin 24S. 

B. Microcon-30 filter (Amicon): the DNA containing solution was 
washed by applying to the filter and exchanging twice with H 2 0 by spinning at 
12,000 RPM for 6 min in a microfuge. 

25 C. Bio-Spin 6 column ( u Bio-Spin H , BioRad): Salts and glycerol were 

removed from the DNA solution by applying to the column, previously equilibrated 
in TEAB (triethyl ammonium bicarbonate pH 7.0), and spinning in a Sorvall RC5B 
centrifuge using an HB4 rotor at 2,500 RPM for 4 min. 

30 16. Agarose DNA Purification Using Ultrafee»-MC Filter Unit: 

This procedure can be used for agarose slices up to 400 /xl in size. After 
agarose gel electrophoresis the DNA is visualized by ethidium bromide staining and 
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the agarose block containing the DNA band of interest is excised. The agarose is 
then frozen at -20°C for 1 hour; then quickly thawed at 37°C for 5 minutes. The 
agarose is then thoroughly macerated. The pieces are then transferred into the 
sample cup of the filter unit and spun at 5,000xg in a standard microfuge for 20 
5 mins. The agarose is then resuspended in 200 pi of Tris-EDTA, or other buffer, 
and incubated at room temperature for 30 mins. to allow for elution of additional 
DNA from the gel. The mixture is then centriftiged for an additional 20 min at 
10,000 RPM. The DNA is, at this point, in the filtrate tube separated from the 
agarose fragments and ready for subsequent DNA manipulations. 

10 

Bacterial Transform ation Methods 

1. Preparation o f transformation-competent E. coli cells: 

A culture of 200 ml of sterile L broth was inoculated with a small loopftil of 
£. coli cells. This was incubated with shaking at 37©C until the OD^was 

15 approximately 0.5. The culture was placed on ice for 10 min and centriftiged at 
6,000xg for 10 minutes. The cell pellet was resuspended in 100 ml of ice-cold 
0.1M MgCl 2 , kept on ice for 30-40 min and centriftiged again. The pellet was 
resuspended in 2 ml of ice-cold lOOmM CaCU, transferred to a sterile test tube and 
incubated on ice for 24 hours. The competent cells were then aliquoted and stored 

20 at-70°C. 

2. Transformation of E, coli; 

An aliquot of frozen competent cells were thawed on ice. To 50 /xl of cells 
0.1 to 1 ptg of DNA was added and the mixture was incubated on ice for 30 minutes. 
25 The tube was removed from ice and placed in a 42<>C bath for 2 minutes. L broth 
(1 ml) was added and the transformation mix incubated with shaking at the desired 
temperature (usually 30°C or 37<>C) for 2 hours. Then one-tenth of the 
transformation was plated on L broth plates containing the appropriate antibiotic 
and, when necessary, XGAL and IPTG were added. 

30 
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3. DNA transformation of B. subtilis: 

B. subtilis cells were grown to early stationary phase (change in Klett units of 
*5% in 15 min). Transformation followed established procedures (Anagnostopoulos 
et ah, 1981) (ref. 8). Cells (0.45 ml) were incubated with 1-10 /xg of DNA at 37<>C 
5 for 80 minutes with shaking, and then plated on TBAB agar plates with an 
appropriate antibiotic. 

4. Isolation of plasmid DNA from B. subtilis: 

Plasmid DNA from B. subtilis was obtained by a method similar to the 
10 alkaline-lysis method except that pelleted cells were resuspended in 8 ml of solution 
1 (50m M glucose, 10 mM EDTA, 25 mM Tris-HCl (pH 8.0), 10 mg/ml lysozyme) 
and incubated at room temperature for 30 minutes. Then 16 ml of solution 2 (0.2N 
NaOH, 1 % (w/v) SDS) was added and incubated on ice for 10 minutes. Finally, 12 
ml of 3M potassium acetate (pH 4.8) was added and incubated an additional 20 min 
15 on ice. The lysed cells were centrifuged 15 min at 15,000 rpm in a Sorval SS-34 
rotor. The DNA was precipitated by adding an equal volume of isopropyl alcohol 
and centrifuged at 7,000 rpm. The pellet was resuspended in 5 ml of lOmM Tris- 
HCl (pH 7.5), 1 mM EDTA (TE). The solution was phenol extracted once and 
chloroform extracted. DNA was precipitated with ethanol and resuspended in 3 ml 
20 of TE. The volume was adjusted to 5.2 ml by adding 4.2g CsCl, 400 fil of ethidium 
bromide at 10 mg/ml and TE. The solution was transferred to a Beckman quickseal 
polyallomer centrifuge tube and centrifuged at 45,000 rpm in a Beckman vti65 rotor 
for 18 hours. 

25 Antibody Production. Protein Chemistry and Electrop horesis of Proteins 
1. Preparation of antibody to artificially sy nthesized peptides: 

Synthetic peptide of sequence (GAGAGS) 8 GAAGY was coupled to BSA 
using the glutaraidehyde procedure of Kagen and Glick (1979). The degree of 
coupling was monitored using trace amounts of radioactive iodinated synthetic 

30 peptide. 

A peptide of 53 amino acids corresponding to the SlpIII sequence was 
prepared on an Applied Biosystems peptide synthesizer. The yield of this material, 
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which has a molecular weight of 3640 was approximately 0.5 grams. The peptide 
was coupled to bovine serum albumin. The material was sent to Antibodies, Inc. for 
preparation of antibodies in rabbits. Peptide conjugates at a concentration of 
lmg/ml in complete Freund's adjuvant were used to immunize rabbits at day 0. 

5 Animals were re-injected with antigen in Freund's incomplete adjuvant at day 30 
and titered at day 60. Positive sera was detected using a microtiter RIA using the 
synthetic peptide as antigen. Kagen and Glick (1979), in Methods of 
Radioimmunoassay, Jaffe and Berman (eds.), Academic Press, p 328. Antisera was 
obtained that reacted with synthetic peptides of both the SlpI and SlpIII sequences. 

10 These antisera have been useful for the detection of fusion peptides containing giy- 
ala (SLP) sequences. 

Following the procedure described above an antigen was synthesized having 
the formula (V-P-G-V-G) 8 , which was coupled to keyhole limpet hemocyanin. 
Polyclonal antisera was then prepared as described above which bound to the ELP 

15 peptide. 

Following the same procedure, additional antigens were synthesized having 
the formula YTITVYAVTGRGDSPASSKPISINYC of fibronectin (the FCB portion) 
and the formula (GAPGAPGSQGAPGLQ) 2 YMK (a repeat unit of the collagen-like 
protein (CLP) sequence) which were coupled to keyhole limpet hemocyanin for use 
20 as immunogens. Polyclonal antisera were then prepared as described above which 
bound, respectively, to the FCB peptide, and to the synthetic peptide of the CLP 3.7 
sequence and PPAS sequence described below. 

2. Polvacrvlam ide gel electrophoresis of proteins; 

25 Approximately 10 9 E. coli cells from growing cultures were pelleted by 

centrifugation at 10,000xg for 5 minutes. The cell pellets were resuspended in 100 
to 500 fi\ of 2X sample buffer (100 mM Tris-HCl pH 6.8, 4% SDS, 10% p- 
mercaptoethanol, 60% glycerol or sucrose) and sonicated for 30 sec using a Tekmar 
sonic disrupter. Samples were boiled for approximately 5 min and 20 to 100 jxl of 

30 the cell lysates were loaded on an SDS-polyacrylamide gel (7.5 to 16% w/v). The 
gels were prepared following the procedure of Laemmli, Nature (1970) 227:680- 
685. The proteins in the gels were stained with 2% Coomassie brilliant blue in 10% 



31 



WO 98/10063 PCT/US96/15306 

methanol, 7.5% acetic acid for 1 hr and destained in 10% methanol, 7.5% acetic 
acid overnight. 

3. Protein expression analysis; 

5 An overnight culture which had been grown at 30° C was used to inoculate 50 

ml of the LB media contained in a 250 ml flask. Kanamycin was added at a final 
concentration of 50 jig/ml and the culture was incubated with agitation (200 RPM) 
at 30°C. When the culture reached an OD^ of 0.8, 40 ml were transferred to a 
new flask pre warmed at 42°C. And incubated at the same temperature for 

10 approximately 2 hours. The cultures (30°C and 42°C) were chilled on ice and 
OD^ was taken. Cells were collected by centrifugation and then divided in 1.0 
OD^ aliquots and used to perform western analysis using the appropriate 
antibodies. 

15 4. ImmunoblQtting of proteins in gels: 

After protein electrophoresis, one of the flanking glass plates was removed 
from the polyacrylamide gel. The gel surface was wetted with transfer buffer (25 
mM Tris-HCl, 192 mM glycine, 20% methanol). A piece of nitrocellulose paper 
(Sartorius, SMI 1307) was saturated with transfer buffer and laid on the gel. Air 

20 bubbles between the filter and the gel were removed. The gel and nitrocellulose 
filter were placed in the transfer unit as specified by manufacturer (BioRad). 
Transfer was allowed to proceed at 200 mA for 3-4 hours. Then the nitrocellulose 
filter was removed and stained with Amido-Schwartz for 3 min (0.05% Amido 
black, 45% deionized H 2 0, 45% methanol, 10% acetic acid) and destained in H 2 0. 

25 The filter was incubated for at least 10 min at room temperature in "BLOTTO" (5% 
w/v nonfat dry milk, 50 mM Tris-HCl pH 7.4, 0.9% w/v NaCl, 0.2% w/v sodium 
azide). The filter was placed in serum appropriately diluted (1:50 to 1 :500) in 
0.5X Blotto (2.5% nonfat dry milk, 50 mM Tris-HCl pK 7.4, 0.9% NaCl, 0.2% 
sodium azide) and was gently agitated for approximately 16 hr at room temperature. 

30 The filter was washed for 1 hr with 5 changes of TSA (50 mM Tris-HCl pH 7.4, 
0.9% NaCl, 0.2% sodium azide). The blot was placed in 15 ml of 0.5X BLOTTO 
solution containing lxl0 7 cpm of the 125 I-protein A and gently agitated for 2 hr at 
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room temperature. The filter was washed for 2 hr with a minimum of 7 changes of 
TSA, rinsed once with deionized H 2 0 and air dried. The blot was covered with 
Saran wrap and autoradiographed. 

An alternative to the 125 I-Protein A detection method was also used. This 
5 method relied on a chemiluminescent signal activated by horseradish peroxidase 
(HRP). The chemiluminescent reagents are readily available from several suppliers 
such as Amersham and DuPont NEN. The western blot was prepared and blocked 
with BLOTTO. A number of methods were used to introduce the HRP reporter 
enzyme including, for example, a hapten/anti-hapten-HRP, a biotinylated 

10 antibody/streptavidin-HRP, a secondary reporter such as a goat or mouse anti-rabbit 
IgG-biotinylated/streptavidin-HRP, or a goat or mouse-anti rabbit IgG-HRP. These 
reagents were bought from different sources such as BioRad or Amersham and 
occasionally biotinylated antibodies were prepared in our laboratory using Biotin 
NHS from Vector Laboratories, Burlingame, CA. (Cat. #SP-1200) following the 

15 procedure accompanying the product. The following is an example of a procedure 
used to detect the expression of protein polymers. 

The blot was placed in 15 ml of BLOTTO solution containing biotinylated 
goat anti-rabbit IgG (BioRad) diluted in BLOTTO (1:7500) and gently agitated for 2 
hrs at room temperature. The filter was then washed for 30 minutes with 3 changes 

20 of TSA (50 mM Tris-HCl pH 7.4, 0.9% NaCl, 0.2% sodium azide). The blot was 
then incubated for 20 minutes at room temperature with gentle rotation, in 20 ml of 
TBS (100 mM Tris Base, 150 mM NaCl, pH 7.5) HRP-Streptavidin (Amersham) 
diluted 1:1000 in TBS with 0.1% Tween 20. The blot was then washed three times 
for 5 minutes each in TBS with 0.3% Tween 20 and then three times for 5 minutes 

25 each in TBS with 0. 1 % Tween 20. The blot was then incubated for 1 minute with 
gentle agitation in 12 ml of development solutions #1 an #2 (Amersham) equally 
mixed. The blot was removed from the development solution and autoradiographed. 

5. Amino Acid Analysis; 
30 Amino acid compositions are determined by the PTC derivitization procedure 

of Henrickson and Meredith (1984). Protein samples were hydrolysed with 5.7 N 
constant boiling KC1 at 108<>C for 24 hours in vacuo. After reaction with PITC, 
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amino acid derivatives were detected at 254 nm by HPLC reverse phase 
chromatography using a Hewlett Packard 1090 or Waters 600E system and a 
Supelco C18 column (4.6 mm x 25 cm) with a linear gradient of 0-50% acetonitrile 
in 0.1 M NH 4 0Ac pH 6.78 as a mobile base. Henrickson, R.L. and Meredith, S.C. 
5 (1984) Amino Analysis by Reverse Phase High Performance Liquid 
Chromatography. Anal. Biochem. 137:65-74. 

6. Amino Acid Sequence Analysis: 

The N-terminal amino acid sequence was determined by automated Edman 
10 degradation using an Applied Biosystems Model 470A gas phase protein sequenator. 
The PTH amino acid derivatives were analyzed by reverse phase HPLC using a 
Hewlett Packard 1090 or Waters 600E system and an Altex C18 column (2 mm x 25 
cm) with a complex gradient buffer system. 

15 7. Peptide Synthesis: 

Synthetic peptides were prepared by solid phase synthesis on an Applied 
Biosystems Model 430A Peptide Synthesizer using the standard symmetric anhydride 
chemistry as provided by the manufacturer. The coupling yield ai each step was 
determined by the quantitative ninhydrin procedure of Sarin et al., (1981). The 

20 synthetic peptide was cleaved from the solid support and amino acid blocking groups 
were removed using anhydrous HF (Stewart and Young, 1984). Crude peptides 
were desalted by chromatography over Sephadex G-50. Sarin, V.K., Kent. S.B.H., 
Tam, J.P. and Merrifield, R.B. (1981). Anal Biochem. 237:927-936. Stewart, 
J.M. and Young, J.D. (1984). Solid Phase Peptide Synthesis, Pierce Chemical 

25 Company, Rockford, IL. pp 85-89. 

Synthetic DNA Methods 

l. In vitro PNA synthesis; 

The N,N-diisopropylphosphoramidites or p-cyanoethylphosphoramidites, 
30 controlled-pore glass columns and all synthesis reagents were obtained from Applied 
Biosystems, Foster City, California. 
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Synthetic oligonucleotides were prepared by the phosphite triester method 
with an Applied Biosystems Model 380A or 381 A DNA synthesizer using a 10-fold 
excess of protected phosphoramidites and 0.2 or 1 pinole of nucleotide bound to the 
synthesis support column. The chemistries used for synthesis are the standard 

5 protocols recommended for use with the synthesizer and have been described 
(Matteucci, et ai., Journal Amer. Chem. Soc, 103:3185-3319 (1981)). 
Deprotection and cleavage of the oligomers from the solid support were performed 
according to standard procedures as described by McBride, et aL, Tetrahedron 
Letters, 24:245-248 (1983) and as provided by Applied Biosystems. The repetitive 

10 yield of the synthesis as measured by the optical density of the removed protecting 
group as recommended by Applied Biosystems (1984) was greater than 97.5%. 

The crude oligonucleotide mixture was purified by preparative gel 
electrophoresis as described by the Applied Biosystems protocols of November 9, 
1984 (User Bulletin No. 13) and as updated in the Applied Biosystems protocols in 

15 Evaluating and Isolating Synthetic Oligonucleotides, 1992. The acrylamide gel 
concentration varied from 10 to 20% depending upon the length of the oligomer. 
The purified oligomer was identified by UV shadowing, excised from the gel and 
extracted by the crush and soak procedure (Smith, Methods in Enzymology, 65:371- 
379(1980)). 

20 For DNA synthesis of oligonucleotides longer than 100 bases, the synthesis 

cycle was changed from the protocol recommended by Applied Biosystems for the 
381 A DNA synthesizer. All the reagents used were fresh. All the reagents were 
supplied by Applied Biosystems except for the acetonitrile (Burdick and Jackson 
Cat#017-4 with water content less then 0.001 %) and the 2000 A pore size column 

25 (Glen Research). Due to the length of the oligo, interrupt pauses had to be inserted 
during the synthesis to allow changing the reagent bottles that emptied during 
synthesis. This interrupt pause was done at the cycle entry step and the pause was 
kept as short as possible. The washes after detritylation by TCA, through the 
beginning of each synthesis cycle, were increased from about 2x to 3x over the 

30 recommended time. The time allocated for the capping was also increased to limit 
truncated failure sequences. After the synthesis the deprotection was done at 55°C 
for 6 hours. After desalting the synthesized DNA was amplified using PCR. 
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2. Sequencing of DNA: 

DNA sequences were determined by the following methods. Fragments 
containing the region of interest were cloned into the multiple cloning site of 
M13mpl8 or M13mpl9 and single-stranded DNA was prepared and sequenced by 

5 the primer extension method as described in the literature. (Sanger et al. 1977; 
Maniatis et al, 1982; Norrander et al 1983. Gsn^ 26:101-106 ; Sanger et al 1977 
Proc. Natl. Acad. Sci. USA. 74:5463-5467 and Biggin 1983 Proc, Natl. Acad. 

80:3963-3965; Sanger** a/. 1978, FEBS Letters. 87: 107-1 101 using 35 S- 
deoxy adenosine 5' (alpha-thio)- triphosphate (New England Nuclear) as label. In 

10 some cases, reverse transcriptase (Molecular Genetics) was used to extend the 

primer, using the dideoxy:deoxynucleoside tri-phosphate ratios utilized by Zagursky 
etal.. Gene Anal Techn. (1985) 2:89-94. 

Deoxyadenosine triphosphate labeled with either 32 P or 35 S was used in these 
reactions. Compression artifacts which appeared in some G-C rich sequences were 

15 overcome by eliminating deoxyguanosine triphosphate from the G reaction, and 
using deoxyinosine triphosphate (P-L Biochemicals) at a final concentration of 37.5 
/*M instead. In the other mixes, the concentration of dideoxyGTP in the G reaction 
was 0.5 mM. All sequences were run on 6 or 8% polyacrylamide gels containing 8 
M urea (Sanger et al. 1978). Primers used for sequencing were purchased from P-L 

20 Biochemicals. Storage and analysis of data utilized software from both DNAstar 
and International Biotechnologies, Inc for IBM personal computer and DNA Strider, 
DNA Inspection lie or DNAid for Apple Macintosh personal computer. 

3. In vitro mutagenesis of cloned DNA: 

25 Plasmid DNA (1 /ig) containing the sequence to be mutated was digested in 

two separate reactions. One reaction contained either one or two restriction 
endonucleases which cleave at sites immediately flanking the region of interest. In 
the second reaction, the DNA was digested with a restriction endonuclease which 
cleaves only once at a site distant from the sequence to be mutated. The DNA 

30 fragments generated in the first reaction were separated by agarose gel 

electrophoresis and the large fragment which lacks the sequence to be mutated was 
excised and purified. DNA from the second reaction, the large fragment of DNA 
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from the first reaction, and a synthetic oligodeoxy nucleotide of 20-30 bases in length 
containing the mutant sequence were mixed in a molar ratio of 1:1:250. The 
mixture was denatured by heating at 100oC for 3 min in 25 to 100 pi of 100 mM 
NaCl, 6.5 mM Tris-HCl pH 7.5, 8 mM MgCl 2 , and 1 mM P-mercaptoethanol. The 
5 denatured mixture was reannealed by gradually lowering the temperature as follows: 
37oC for 30 min, 4°C for 30 min, and Q°C for 10 minutes. The reaction was 
supplemented with 0.5 mM deoxyribonucleotide triphosphates, 1 mM ATP, 400 
units of T4 DNA ligase and 5 units of £. coli DNA polymerase large fragment and 
incubated at 15°C for 12-16 hours. The reaction mixture was then transformed into 
10 E. coli and antibiotic-resistant colonies were selected. 



4. Dideoxv DNA Sequenc ing of Double Stranded Plasmid DNA: 

Plasmid DNA was prepared as described previously (Preparation of plasmid 
DNA from E. coli. Small Scale, Maniatis et al.). Primers were synthesized using a 
15 DNA synthesizer as described previously, and were annealed to the plasmid DNA 
following the procedure described above for Ml 3 sequencing. The sequencing 
reactions were done using Sequenase (United States Biochemicals) and the 
conditions were as recommended by the supplier. All sequences were run on 
polyacrylamide gels as described above. 

20 

5. PCR amplification; 

The PCR reaction was performed in a 100 fi\ volume in a Perkin Elmer thin- 
walled Gene Amp™ reaction tube. Approximately 1 fil of each primer DNA was 
added to lx PCR buffer (supplied by Perkin Elmer as lOx solution), 200 fxM of each 

25 dNT, 5U AmpliTaq, and several concentrations of the target DNA. Amplification 
was performed in a Perkin Elmer DNA Thermal cycler model 480 for 30 cycles 
with the following step cycles of 12 min each: 95°C, 62°C, and 72°C. Aliquots 
from the different reactions were analyzed by Agarose Gel Electrophoresis using 
1.5% Low Melting Point agarose in 0.5x Ta buffer. The reaction mixtures that 

30 gave the desired band were pooled and spun through an Ultrafree-Probind filter unit 
(Millipore) at 12,000 Rpm for 30 seconds in a Sorvall Microspin 24S to remove the 
AmpliTaq enzyme. The buffer was then exchanged with H 2 0 two times, using a 
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Microcon-30 filter (Amicon) by spinning at 12,000 RPM for 6 min in a microfuge. 
Salts and glycerol were removed from the amplified dsDNA using a Bio-Spin 6 
column (from BioRad) equilibrated in TEAB, in a Sorvall RC5B centrifuge using an 
HB4 rotor at 2,500 RPM for 4 minutes. The DNA was then concentrated in vacuo. 

5 

Fermentation Conditions 

The fermentor is a 15 L Chemap, 10 L working volume. The culture 
conditions are: temperature = 30°C, pH 6.8; NaOH 2.5 M is used for pH 
regulation. The headspace pressure is below 0. 1 bar. The dissolved oxygen is 
10 regulated at 50%. The air flow varies from 0.5 L/min to 20 L/minute. The 

agitation rate varies between 200 to 1500 rpm. The fermentor is inoculated with a 
10% (v/v) inoculum grown in medium A for 15 hours at 30°C under agitation. 

Medium B, C or D was the fermentor medium. The starting volume in the 
case of 10 liter fermentation, is no less than 3 L, and in the case of a 1 liter 
15 fermentation, is no less than 0.5 liters. 

If the fermentor starting volume is less than the final volume desired, then 
when the carbon source concentration reaches 1 %, a concentrated solution (5x) of 
medium B,C, or D, respectively, is added to the fermentor in order to keep the 
carbon source concentration approximately 1%. 
20 When the culture reached an OD^ of 60.0, the temperature was increased to 

42°C for 10 min, then lowered to 39 or 40°C for 2 - 3 hours. The cells were then 
harvested by centrifugation and, if necessary, frozen at -70°C until processed. 

Other fermentors used for the expression of protein polymers were usually a 
15 1 MBR, 10 1 working volume, or a 13 1 Braun Biostat E, 8.5 I working volume. 
25 The choice of the fermentor and its size is not critical. Any media used for the 
growth of E. coli can be used. The nitrogen source ranged from NZ Amine to 
inorganic salts and the carbon source generally used was glycerol or glucose. All 
fermentations were done with the appropriate selection conditions imposed by the 
plasmid requirements (e.g. kanamycin, ampicillin, etc.). The fermentation method 
30 used to express protein polymers in £. coli was the fed-batch method. This is the 
preferred method for the fermentation of recombinant organisms even if other 
methods can be used. 
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10 



15 



The fed-batch method exploits the stage of cell growth where the organisms 
make a transition from exponential to stationary phase. This transition is often the 
result of either depletion of an essential nutrient or accumulation of a metabolic 
byproduct. When the transition is the result of nutrient depletion, the addition of 
nutrients to the system causes cell division to continue. One or more essential 
nutrients can incrementally be added to the fermentation vessel during the run, with 
the net volume increasing during the fermentation process. The result is a controlled 
growth rate where biomass and expression levels can be optimized. When the cell 
number in the culture has reached or is approaching a maximum, protein polymer 
production is induced by providing an appropriate physical or chemical signal, 
depending upon the expression system used. Production will then continue until the 
accumulated product reaches maximum levels (Fiestchko, J., and Ritch, T., Chem. 
Eng. Commun. (1986), 45:229-240. Seo, J.H.; Bailey, J.E., Biotechnol. Bioeng. 
(1986), 28:1590-1594. 



Table L Medium Table 



Medium A: LB Medium 



Constituent 



NaCl 



10 



tryptone 



10 



yeast extract 



5 



kanamycin 



5x10° 
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Medium B 

Constituent gZL 

NH 4 C1 4.5 

KH ; P0 4 0.76 

5 MgS0 4 -7H 2 O 0.18 

K,S0 4 0.09 

CaCl, 24xlO J 

FeS0 4 - 7H 2 0 7.6x10° 

TE 0.5 ml 

10 casamino acids 25 

yeast extract 5 

glucose 20 

kanamycin 5xl0 3 



15 Medium D 

Consituent g/1 

(NH 4 )S0 4 5.6 

K,HP0 4 6.7 

MgS0 4 ' 7H 2 0 7.8 

20 NaH : P0 4 H,0 3.8 

EDTA 0.98 

Trace Elements 1 ml 

Yeast Extract or NZ Amine 50 

Glucose or glycerol 20 

25 Kanamycin or ampicillin 5 x 10' J 



Example 2 

Assembly and Expression of the SIdI Gene 
1. Summary of the scheme for assembling the Slol gene: 
30 An 18 bp DNA sequence that codes for the most frequent repeating 

oligopeptide in the silk fibroin protein made by Bombyx mori [Lucas, F. and K.M. 
Rudall (1986) Extracellular Fibrous Proteins: The Silks, p. 475-558, in 
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Comprehensive Biochemistry, vol. 26, part B., M. Florkin and F.H. Stotz (eds.) 
Elsevier, Amsterdam] was synthesized in vitro. Two single-strands were 
synthesized, annealed together and then the resulting double-stranded segments were 
multimerized head-to-tail to generate concatamers of up to and exceeding 13 repeats. 
5 The structural gene for silk I that we proceeded to work with had 13 repeats that 
coded for the oligopeptide GAG AGS, where g = glycine, a = alanine and 
s = serine. We refer to this structural gene as the "monomer". We constructed 
"dimeric, trimeric, tetrameric, pentameric and hexameric" SlpI genes containing 26 
(SlpI-2), 39 (SlpI-3), 52 (SlpI-4), 65 (SlpI-5) and 78 (SlpI-6) repeats. There is a 
10 short intervening sequence between each monomer unit. The assembly is pictured as 
follows: 

Repeating DNA Sequence 5 '-GGTGCGGGCGCAGGAAGT 

CGCCCGCGTCCTTCACCA-5 ' 

15 



"Monomer" 



Mul timers 



2. Assembly of the "monomeric" SlpI structural gene: 

The two single-strands shown above were synthesized as previously 

20 described. The strands were separately purified by gel electrophoresis, 

phosphorylated using T4 polynucleotide kinase and then mixed together and allowed 
to anneal. This resulted in the double-stranded segments aligning spontaneously 
head-to-tail in long concatamers. The phosphodiester bonds between segments were 
formed with T4 DNA ligase. The reaction was stopped by filling in the terminal 

25 cohesive ends using the Klenow fragment of DNA polymerase I. The blunt-ended 
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repeating DNA was then ligated to the Hindi REN site in plasmid vector pUC12 
(Veiera, et al., Gene 19:259-268 (1982)). The ligated DNA was transformed into 
E. coli HB101 and transformants were selected for their ability to grow in the 
presence of ampicillin. The DNA of potential clones was analyzed; for size and 
5 orientation by REN digestion and gel electrophoresis. DNA sequences were 
determined for isolates with large inserts that were oriented properly. The 
"monomer" clone selected for subsequent multimerization had 13 repeats coding for 
the oligopeptide AGAGSG, and was named pSY708. The DNA sequence, deduced 
amino acid sequence and REN sites of the SlpI insert and flanking regions of 
10 pSY708 are shown in Table 2. 

Table 2 

H PAS 
I S V M 

N T A A 

15 3 111 
I III 
1 ill 

AAGCTTGGGCTGCAGGTCACCCGGGCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGT 
TTCGAACCCGACGTCCAGTGGGCCCGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCA 
20 KLG LQVT RAGAGSGAGAGSG 

GCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGTGCGGGC 
CGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCACGCCCG 
AGAG SGAGAGSGAGAGSGAG 

25 

GCAGGAAGTGGTGCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGTGCGGGCGCAGGA 
CGTCCTTCACCACGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCACGCCCGCGTCCT 
AGS GAGAGSGAGAGSGAGAG 

30 AGTGGTGCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGT 
TCACCACGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCA 
SGAGAGS GAGAGSGAGAGSG 



60 
120 
180 
240 
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X 


B 


A S 


E 


B 


A 


V M 


C 


A 


M 


A A 


R 


1 


1 


1 1 


1 



40 I I I I I 

I I I I I 

GCGGGCGCAGGAAGTGGGACTCTAGAGGATCCCCGGGCGAGCTCGAATTC 

CGCCCGCGTCCTTCACCCTGAGATCTCCTAGGGGCCCGCTCGAGCTTAAG 290 
AGAGS GT L0DPRASSNS 

45 



3. Construction of the expression vector. pSY7Ql; 

Plasmid pSP65 (10 /zg, Boehringer Mannheim) was digested with Aatll 
REN, phenol extracted and ethanol precipitated. The DNA was resuspended in 10/xl 
50 of H 2 0. One-half of this DNA was digested with exonuclease III in the following 
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mix: 5 jig DNA, 10 /xl 10X exonuciease III buffer (600 mM Tris-HCl pH 8.0, 6.6 
mM MgCl 2 , 10 mM P-rnercaptoethanol) and 9 units of exonuciease III in a total 
volume of 200 pil. Samples of 20 fil were taken at 0, 1, 2.5, 5 and 7.5 min and 
diluted immediately in 100 /xl of the following buffer (30 mM sodium acetate, pH 

5 4.5, 0.25 M NaCl, 1 mM ZnS0 4 ) containing 5 /ig tRNA and 36 units of SI 

nuclease. Incubation was at 30°C for 45 min and then the reaction was terminated 
by the addition of 15 /zl of stop buffer (0.5 M Tris pH 9.0, 125 mM EDTA, 1 % 
w/v SDS, 200 jig/ml tRNA). The samples were phenol extracted and ethanol 
precipitated. The resuspended DNA was digested with Smal REN and 

10 electrophoresed through a 1 % gel of low melting point agarose. The gel band 
corresponding to the DNA fragment carrying the P-lactamase gene, the plasmid 
origin and the P-galactosidase promoter was excised from the gel and melted at 
65°C. One volume of H 2 0 was added. The DNA in each sample (timepoint) was 
recircularized by ligation in the presence of agarose. The reaction included 8 pi 

15 melted gel, 2 fil of ligation buffer (100 mM Tris-HCl pH 7.5, 50 mM MgCl 2 , 50 
mM DTT, 1 mM ATP), 10 units T4 DNA ligase and was incubated at 15oC for 3 
hours. Competent cells of JMlOl were transformed with the ligated DNA and 
transformants were selected by growth on L broth plates containing ampicillin (40 
pg/ml). Plasmid DNA was prepared from four transformants. The DNA was 

20 digested with BamHI REN, labeled with 32 P-dGTP using the Klenow fragment of 
DNA Polymerase I, digested with Pvu I and then the smallest fragment was gel 
purified. The fragment from one transformant was sequenced using the Maxam and 
Gilbert technique. The fragments of the other three plasmids were further digested 
with TaqI and electrophoresed on the same gel. The sequenced plasmid had a fusion 

25 between the multiple cloning site and a position upstream from the N-terminal ATG 
of p-lactamase. The size of the BamHI-TaqI fragment of two of the other plasmids 
indicated a fusion between the multiple cloning site and the 4th amino acid of the P- 
lactamase gene. The DNA and corresponding amino acid sequences of the N- 
terminal region of the altered P-lactamase are given below, along with a circular 

30 map of REN sites for pSY701 are shown in Fig. 1. The amino acid sequence of 
Fig. 1 is met-thr-met-ile-thr-pro-ser-leu-gly-cys-arg-ser-thr-leu-glu-asp-pro-his-phe- 
arg-val-ala-leu-ile-pro-phe-phe-ala-ala-phe-cys-leu-pro-val-phe-ala-his. 
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4. Insertion of "m onomer" SlpI From pSY708 into pSY701: 

Plasmid pSY708 was digested with HindlH, the cohesive ends were filled in 
using the Klenow fragment of DNA polymerase I and then digested with BamHI. 
Plasmid pSY701 was digested with Xbal, filled in as above and then digested with 

5 BamHI. The DNA fragment from pSY708 and the backbone of pSY701 were then 
purified by electrophoresis through a low melting temperature agarose gel and 
purified with NACS (BRL) columns. The appropriate fragments were mixed, 
ligated, and then transformed into £. coli JM109. Transformed cells were selected 
by growth on L plates containing ampicillin (40 mg/ml), IPTG (5X10* 4 M) and 

10 XGAL (20 mg/ml). Transformants were analyzed for plasmid contents and one 
(pSY756) was selected for further study since it carried the insert of the monomer 
SlpI-1 sequences in the proper orientation, as determined by mapping of REN sites. 
Although the entire DNA sequence was not determined for pSY756, the junctions 
between the insert and vector were verified as correct restriction sequences for Xbal, 

15 upstream and BamHI, downstream. 

5. Multimerization of the SlpI gene of DSY756: 

Plasmid pSY708 was digested with the REN Smal and the DNA fragment 
carrying the coding sequence for the polypeptide arg( ala-gly-ala-gly-ser-gly), 3 thr- 

20 ieu-glu-asp-pro (R(AGAGSG) 13 TLEDP) was purified as in 4 above. Plasmid 
pSY756 was digested with Smal, deproteinized and then ligated with the purified 
DNA fragment from pSY708. Transformants of £. coli JM109 were selected on 
medium containing ampicillin. Clones were found to contain 2 units (dimer 
pSY882), 3 units (trimer pSY883), and 4 units (tetramer pSY915) of the original 

25 monomer sequence of the pSY708 clone. Similarly, pentamers and hexamers have 
also been constructed. All of these plasmids are genetically stable and produce the 
gly-ala peptide as a fusion with P-lactamase. 

6. Expression of the SlpI gene fusion to the P-lactamase nrotein: 

30 Synthesis in E. coli cells of the SlpI peptide as a fusion protein with P- 

lactamase was detected by irnmunoblotting (Western) analysis. Anti-"Slp" 
antibodies were raised against a synthetic silk peptide. Fusions between p-lactamase 
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and SlpI were also detected with antibodies raised against the £. coli P-lactamase. 
As shown in Figure 2, this antibody reacts with dimers and trimers of SlpI fused to 
the E. coli P-lactamase. The SlpI insert precedes the fifth amino acid of the signal 
sequence for this enzyme. The P-lactamase antibody (Figure 2A) detects both the 

5 unprocessed fusion proteins as well as the processed mature enzyme which appears 
as the major antigenic band in this figure, at about the 28 kDal position. The 
mobilities of all Sip-containing polypeptides are anomalously slow and the proteins 
are not as large as they appear on the gels. 

The anti-Sip antibody also is useful in detecting these fusion products. Lanes 

10 2-5 of Figure 2B represent 4 separate clones that contain dimer fusions of SlpI with 
P-lactamase, while lanes 6 and 7 are from two clones containing trimer fusions. As 
can be seen the antigenicity of the trimer is considerably greater than for the dimer. 
It is known from prior experiments that fusion proteins containing only a monomer 
of SlpI are not detected at all with this anti-Sip antibody. The increased antigenicity 

15 of the trimer peptide allows it to be detected as a processed fusion with the P- 

lactamase signal peptide. The processed form is seen at about the 33 kDal position . 
in lanes 6 and 7 of Figure 2B. The appearance of normally processed p-lactamase 
mature enzyme (detected with p-lactamase antibody) as well as a peptide 
corresponding to the fusion between the SlpI-3 trimer and the signal peptide of P- 

20 lactamase (detected with gly-ala antibody) suggests that despite the insertion of SlpI 
sequences within the signal sequence, normal proteolytic processing of the enzyme 
occurs in E. colL 

7.a. Expression of the SlpI gene bv fusi on to T7 genes: 

25 The SlpI sequence has also been expressed as a fusion protein with both the 

gene 9 and gene 10 proteins from bacteriophage T7 in £. coli. The construction is 
diagrammed in Figure 3. Plasmid pSY915 (containing the SlpI-4 tetramer) was 
digested to completion with REN Sail and partially with BamHI. The DNA 
fragment containing the SlpI-4 tetramer was purified and then cloned in plasmid 

30 pSYl 14 (pG2 of Promega Biotech) which had been digested with RENs Sail and 
BamHI. From this intermediate plasmid, the tetramer insert of SlpI was removed 
with the RENs AccI and EcoRI. This fragment was then cloned in pSY633 
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(pBR322 containing the complete T7 gene 9 sequence; pAR441 of Studier et al., 
(1986)) which was digested with EcoRI and AsuII. In the resulting plasmid the SlpI 
tetramer is fused to the gene 9 translational reading frame near the C-terminus of 
gene 9. This plasmid was then used to transform £. coli strain 0-48 (strain HMS174 
5 (ADE3) of Studier, et ah, 1986) which contains the T7 RNA polymerase gene 

inserted into the chromosome under transcriptional control of the IPTG-inducible P- 
galactosidase promoter. In this configuration, expression of the Slpl-4 sequence is 
dependent upon production of the T7 RNA polymerase which itself is controlled by 
the IPTG inducible P-galactosidase promoter. As shown in Figure 4B and 4C, when 
10 these cells are induced with IPTG a protein product of the gene 9/SlpI-4 fusion gene 
is synthesized and is detected with antibody to the synthetic Sip peptide. The fusion 
product migrates in the gel as if it was 82 kDal in size. The size expected is only 65 
kdal. The anomalous mobility is characteristic of the unusual amino acid 
composition (rich in glycine and alanine) and is seen for all Sip-containing products. 
15 In like manner, plasmid pSY638 (pAR21 13 of Studier) containing the 

promoter region and the first 13 amino acids of the T7 gene 10 protein, was digested 
with REN BamHI, filled in with the Klenow fragment of DNA polymerase and then 
digested with REN EcoRI. Into this linearized plasmid was cloned the AsuII-EcoRI 
fragment of pSY633, containing the Slpl-4 tetramer. This ligation creates an in- 
20 frame fusion of the silk tetramer following the thirteenth amino acid.of T7 gene 10. 
The latter fusion product may be used for spinning without further processing since 
the N-terminal 13 amino acids are only a small part of the large SlpI protein. 
Although the fusion product is about 30 kDal in size, it has an anomalous mobility 
and migrates as if it was larger, 50 kDal. This is shown in Figure 4A. 
25 The plasmids pG9/SlpI-4 and pG10/SlpI-4 were further improved by 

inserting a kanamycin-resistance gene in the P-lactamase gene in the orientation 
opposite to the T7 expression system. Thus, any low level expression from the T7 
system does not lead to elevated P-lactamase activity. Such activity eliminated the 
ampicillin in the medium that was added to select for maintenance of the plasmid. 
30 When the ampicillin was depleted the plasmids were lost from the culture. The 
kanamycin-resistance gene circumvents this problem and represents a significant 
improvement in the T7 expression system, especially for large scale cultures. The 

46 



WO 98/10063 PCT/US96/15306 

kanamycin-resistance gene (originally from Tn903) was isolated from a plasmid 
pUC4K (Veira, J. and J. Messing (1982) Gene 19:259-268) as a Hindi fragment. 
The plasmid containing pG10/SlpI-4 and the kanamycin resistance gene was 
designated pSY997. 

5 

7.b. Fermentation and purification of SlnI-4; 

E. coli strain 0-48 carrying pSY997 was grown at 37<>C, using a Chemap or 

a Braun fermentor, in 10 L of LB to an OD (Klett units) of 300 (3xl0 9 cells/ml). 

The T7 system was then induced with the addition of 3.5 mM IPTG. After 150 min 
10 the cells were concentrated lOx using a Millipore filter unit (Pellicon cassette 

system, 100,000 molecular weight cut off filter). The cell suspension was then 

frozen at -70oC until processing. 

The cell suspension was melted in a water bath at 42<>C and Iysed in a french 

press, and the lysate was spun at 125,000xg for 1 hour at 25<>C. The cleared 
15 supernatant was treated with DNAase (250 /im/ml) for 15 min at room temperature, 

then filtered through a .45 /xm sterile filter. The filtrate volume was measured and 

incubated in ice with slow stirring. Then 231 mg of ammonium sulphate were 

added for each ml of filtrate over a period of 45 minutes. One ml of NaOH for each 

10 g of ammonium sulphate was added to neutralize the pH. 
20 After 2 hours of continuous stirring the mixture was spun at 9,000xg for 10 

minutes. The pellet was resuspended in 1/10 of the original filtrate volume using 

distilled water. The centrifugation and resuspension was repeated three times. The 

pellet was resuspended in 1/10 of the original filtrate volume in distilled water. 

Samples were analyzed for protein concentration, amino acid composition and 
25 protein sequence by standard methods. This is one of several methods for obtaining 

the product. This method results in a SlpI-4 product that is greater than 90% pure. 

The amino acid composition is almost entirely gly, ala and ser, as expected, and the 

N-terminal amino acid sequence is that of the gene 10 leader. 

30 8. Controlled expression of th e T7 RNA polymerase gene in Bacillus subtilis: 
The coding sequence of the T7 RNA polymerase gene (T7 gene 1, T7 
nucleotides 3128 to 5845) from plasmid pSY558 (pAR1151 of Studier, et al., 1986) 
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was modified by in vitro mutagenesis of cloned DNA. We inserted the recognition 
sequence for the restriction endonuclease Ndel at position 3171. Using an 
oligodeoxynucleotide which was synthesized as previously described, the T7 gene 1 
sequence was changed from its natural sequence, TAAATG, to the modified 
5 sequence, CATATG. 

Similarly, the upstream regulatory sequence of the Bacillus subtilis gene 
spoVG, obtained from plasmid pCB1291 (Rosenblum, et ah, J. Bacteriology, 
148:341-351 (1981)), was modified by in vitro mutagenesis at position 85 (Johnson, 
et al., Nature, 302:800-804 (1983)) such that it also includes an Ndel cleavage site. 
10 The upstream regulatory sequences of the spoVG gene were then ligated with the 
coding sequence of the T7 RNA polymerase gene via these novel Ndel cleavage 
sites. After transformation of E. coli HB101, the plasmid contents of individual 
ampicillin-resistant isolates were checked by restriction mapping. The correct 
construction was named pSY649. 
15 Plasmid DNA containing the spoVG:T7 RNA polymerase fusion gene 

(pSY649) was further modified to include a chloramphenicol-resistance gene that 
functions in B. subtilis. First the Ndel to Sail fragment of about 1200 base pairs 
from plasmid pGR71-P43 (Goldfarb, et al., Nature, 293:309-311 (1981)) was 
isolated. This fragment carries the P43 promoter of B. subtilis and an adjacent 
20 chloramphenicol acetyltransferase gene from Tn9. After filling in all the cohesive 
ends using the Klenow DNA polymerase reaction, this fragment was inserted into 
the Xbal site within the multiple-cloning site of pUC13 (Veiera, et al., Gene, 
19:259-268 (1982)). Ampicillin and chloramphenicol-resistant transformants were 
selected for further use. The correct plasmid construction was named pSY630. The 
25 Smal to Hindi endonuclease cleavage fragment from plasmid pSY630 containing 
the chloramphenicol acetyltransferase gene fused to the P43 promoter sequence was 
gel purified and blunt-end ligated to the Pvul site of plasmid pSY649 that had been 
treated first with T4 DNA polymerase. The resulting plasmid, pSY856, was then 
transformed into B. subtilis 1168. Because plasmid pSY856 is unable to replicate 
30 autonomously in B. subtilis, stable transformants resistant to chloramphenicol must 
result from the integration of the plasmid into the B. subtilis chromosome (Ferrari, 
et al.. 7. Bacteriology, 154:1513-1515 (1983)). The integration event, facilitated by 
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homologous recombination, most likely occurred at either the spoVG or the P43 loci 
of the bacterial chromosome (pSY856 contains DNA sequences homologous to the 
S. subtilis chromosome at only these two sites). The resulting strain, "BIPoL," was 
grown both in the presence and absence of chloramphenicol in order to determine 
5 the stability of the selectable marker. Expression of the T7 polymerase was obtained 
and this has no apparent effect on the growth or viability of this strain. 

9.a. Expression of a plasmid-borne target gene fkana mvcin-resistance) in fl. 
subtilis strain BIPoL: 

10 The Staphylococcus aureus plasmid pUBHO (Lacey, et al., J. Med. 

Microbiology, 7:285-297, 1974) which contains the gene coding for resistance to the 
antibiotic kanamycin was used to test the expression of the growth-regulated 
spoVG:T7 RNA polymerase gene of strain BIPoL. An EcoRI-BamHI fragment of 
phage T7 DNA (positions 21 ,402 to 22,858) containing the T7 gene 9 promoter 

15 sequence was purified from plasmid pAR441 (Studier, et al., 1986). This DNA 
fragment was ligated into pUBl 10 between the EcoRI and BamHI restriction 
endonuclease sites. The resulting plasmid, pSY952, contains the T7-specific 
promoter in the same orientation as the kanamycin-resistance gene. Plasmid 
pSY952 was transformed into B. subtilis 1 168 and BIPoL and these strains were 

20 analyzed for the level of expression of the polypeptide encoded by the plasmid 

derived kanamycin-resistance gene. Approximately 10 9 cells from growing cultures 
of 1168, 1168 containing pUBllO, 1168 containing pSY952, BIPoL, BIPoL 
containing pUBllO, and BIPoL containing pSY952 were obtained at several times 
during the growth and sporulation cycle. The proteins in these cell samples were 

25 processed and analyzed by polyacrylamide gel electrophoresis. 

Because the rate of transcription from the spoVG promoter increases as a 
function of cell density and reaches a maximum during early sporulation, an 
accelerated accumulation of the target protein is expected in the BIPoL strain 
containing pSY952 during growth as the culture enters sporulation. The results 

30 show that a protein of molecular weight 34 kDal increases in abundance as the 
culture approaches and enters stationary phase. The size of the protein is in 
agreement with the predicted size of the kanamycin-resistance gene product (Sadaie, 
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et al., /. Bacteriology, 141:1178-1182 (1980)) encoded in pSY952. This protein is 
not present in BIPoL or 1168 containing pSY952 which lacks the spoVG-regulated 
T7 RNA polymerase gene or in BIPoL containing pUBHO which lacks the T7 
promoter sequence. The maximum accumulated level of target protein after 24 
5 hours of growth in BIPoL containing pSY952 was 20% of the total cellular protein 
as determined by densitometry. 

9.b Expression o f SlpI-4 in B. subtilis: 

Plasmid pGlOSlpI was digested with EcoRI REN. After filling in the 

10 cohesive ends using the Klenow DNA polymerase reaction, the DNA was digested 
with Bglll REN. Plasmid pSY662 was digested with Smal and BamHI RENs. The 
two plasmids were then purified by electrophoresis through a low melting 
temperature agarose gel and purified with NACS (BRL) columns. The DNA 
fragment of pGlOSlpI was ligated to the backbone of pSY662 and transformed into 

15 E. coli containing ampicillin (40 jig/ml). Transformants were analyzed for plasmid 
contents and one (pSY662/G10/SlpI-4) was selected for further study. 

Competent cells of B. subtilis BIPol were transformed with 
pSY662/G10/SlpI-4 and incubated at 37°C with shaking for 90 minutes. The 
transformation mixture was then diluted 1:100 in fresh LB containing 10 /xg/ml of 

20 tetracycline and incubated at 37°C with shaking. Samples were taken and equal 
numbers of cells were lysed and loaded on gels for separation by SDS-PAGE. 
Immunoblot analysis was performed using anti-Sip antibodies to detect the synthesis 
of the gene 10/SlpI-4 fusion protein. 

The expression of the SlpI-4 polypeptide in B. subtilis was detected by its 

25 seroreactivity with anti-Sip antibody, after transfer of the cellular proteins from the 
polyacrylamide gel to a nitrocellulose filter. We verified that the seroreactive 
protein was the product of the SlpM gene by exhaustively treating the cellular 
proteins with CNBr. This should cleave after methionine residues, but since SlpM 
lacks methionine it will remain intact. The CNBr treatment eliminated greater than 

30 98% of the proteins stainable with Coomassie blue dye. And as expected for a 
protein lacking methionine, SlpM remained intact and still reacted with anti-Sip 
serum. 
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Example 3 

/U<emh1v and Expression of the SIdIII Gene 

1. Summary of the scheme for assembling the Slnlll gene; 

The synthetic SlpIII gene codes for a protein similar to the SIpI gene and to 

5 the crystalline region of the silk fibroin protein made by the silkworm, Bombyx 
mori. SlpIII more closely resembles the silk fibroin molecule because it includes the 
amino acid tyrosine at regular intervals (about 50 residues), whereas multimers of 
SlpI do not. The SlpIII gene was assembled from smaller parts. First, three 
doublestranded sections of DNA of about 60 bp in length were chemically 

10 synthesized. Each section was cloned by insertion into bacteriophage H13 and the 
DNA sequence was verified. These sections were then removed from the vector and 
linked together in a specific order. This linkage of about 180 bp is named the SlpIII 
"monomer". "Monomers" were then linked in a specific order to yield dimers, 
trimers, tetramers, etc. of SlpIII. The multimers were then cloned either directly 

15 into plasmid expression vectors to detect the SlpIII protein or initially into an adapter 
plasmid. Insertion of the SlpIII DNA into the adapter allows for further gene 
manipulation and is further described later. The assembly scheme is pictured as 
follows: 



20 2. Synthesis of double-stranded DNA sections 
The assembly Scheme is pictured as follows: 

Section 1 5' 



25 



I 1 I I I I I I I I I I I I I I 



Section 2 5'- 



i i i i i i i i i i i i i I i i 



30 Se " ion3 5 ' I I 1 I I I i M I I I I I I I 



35 Assembly of •'monomer" - 
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Multimerization - 



123 



123 



123 



The DNA and corresponding amino acid sequences of the three sections of 
10 the SlpIII gene are shown in Table 3. 
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The double-stranded DNA sequence is shown in the 5' to 3' direction. The 
amino acids (g = glycine, a = alanine, s = serine, y = tyrosine) coded by the 
sequence are shown immediately below each section. Recognition sequences for 
cleavage by restriction endonucleases are shown above each section. 
5 The above six single-strands were synthesized. After synthesis, the strands 

of DNA were purified and the homologous strands were annealed. About 1 p\ (0.5 
/xg) of each strand was mixed with 2 p\ of 10X AA (description) buffer and 16 fi\ of 
sterilized deionized H 2 0 in a 1 .5 ml polypropylene Eppendorf tube. The tube was 
placed in a boiling water bath (500 ml in a 1 liter beaker) for 10 min and than the 

10 beaker was removed from the hot plate and allowed to cool on the bench to room 
temperature. This required about 1-2 hours. 

Each of the three double-stranded sections was cloned separately into 
M13mpl8. Section 1 was ligated between the Smal and BamHI restriction sites of 
the multiple-cloning site. Section 2 was ligated between the BamHI and PstI sites. 

15 And section 3 was inserted between the PstI and Hindlll sites. The respective clones 
are: M13mpl8.1, M13mpl8.2, M13mpl8.3. The DNA sequence was determined 
for each cloned section. One representative of each section that had the correct 
DNA sequence was recovered and became the material for the next step: assembly 
of the "monomer". 

20 

3. Assembly of the "monome r" of SlpIII: 

The DNA sections 2 and 3 were isolated by digestion of the Ml 3 clones with 
restriction enzymes: for section 2, M13mpl8.2 was digested with BamHI and PstI ; 
for section 3, M13mpl8.3 was digested with PstI and Hindlll. The two sections 

25 were purified and mixed together in equal molar amounts with M13mpl8.1 that had 
been first digested with BamHI and Hindlll. T4 DNA ligase was added to link the 
homologous overlapping ends in the order 1-2-3. Due to the hybridization 
specificity of the cohesive ends, the three sections are efficiently linked in only this 
order. The DNA sequence of the cloned "monomer" in the assembly named 

30 M13mpl8. 1.2.3 was determined to be correct and as shown in 2 above. 
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4. Miiltimerization of the "monomer of SlpIII; 

In order to prepare large amounts of the "monomer" structural gene we first 
subcloned the "monomer" into the plasmid vector pUCI2. M13mpl8.1.2.3 was 
digested with EcoRI and Hindlll restriction enzymes. The SlpIII "monomer" was 
5 gel purified and ligated into pUC12 digested with EcoRI and Hindlll. The resulting 
plasmid DNA was prepared, the "monomer" was released from the vector by 
digestion with BanI REN and the fragment was gel purified. 

To create multimers, "monomer" DNA with BanI ends were linked by 
ligation. The nonpalindromic terminal BanI recognition sequence allows linkage 
10 only in a head-to-tail order. The extent of multimerization is monitored by gel 
electrophoresis and staining the DNA with ethidium bromide. Multimers of more 
than 20 units have been obtained by this method. 

5. Cloning of the multimers of SlpIII: 

15 Plasmid pCQV2 (Queen, et al., 7. AppL Mol. Gen. f 2: 1-10 (1983)) was 

digested with EcoRI and BamHI restriction endonucleases and a fragment of about 
900 bp was purified. This DNA fragment contains the bacteriophage lambda cI-857 
repressor gene, the closely linked rightward promoter, P R , and the beginning of the 
cro gene. Plasmid pSY335 (described as pIF751 in Ferrari, et al., J. Bacteriology, 

20 161: 556-562 (1985)) was digested with EcoRI and BamHI restriction enzymes and 
subsequently ligated to the DNA fragment of approximately 900 bp of pCQV2. The 
plasmid obtained from this construction, pSY751 , expresses the p-galactosidase 
gene at 37oC and 42°C, but not at 30<>C (Figure 8). 

In this approach the SlpIII gene is first cloned into an "adapter" sequence in 

25 an intermediate plasmid and then subcloned to the expression systems. The adapter 
sequence has the following useful features: a unique central BanI REN site, three 
unique REN sites to either side of BanI, information coding for protein cleavage at 
either methionine, aspartate-proline or arginine amino acids and small size. The 
BanI site is the point of insertion for the SlpIII multimers with BanI ends. 

30 The adapter was synthesized with the Applied Biosystems 380A Synthesizer, 

cloned in M13mpl8 and the DNA sequence verified. The adapter was then 
subcloned into a specially-constructed plasmid vector that lacked BanI REN sites. 
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The recipient plasmid was made as follows. Plasmid pJHIOl (Ferrari, et al., 1983) 
was partially digested with Ahalll restriction enzyme and religated. Transformants 
of E. coli HB101 were selected on medium containing chloramphenicol (12.5 
mg/ml). After restriction analysis of several isolates one plasmid was chosen, 
5 pSY325 (Figure 7). This plasmid contains only the chloramphenicol-resistance gene 
and the replication origin (from pBR322) of pJHIOl. After digestion to completion 
with XhoII, pSY325 was ligated with the gel-purified adapter. The result was the 
adapter-plasmid, pSY937. The new pSY937 REN sites were verified. 

The SlpIII multimers were cloned into the Banl site of pSY937 (Figure 7). 

10 Positive clones were identified by colony hybridization and with the lower strand of 
section 1 of SlpIII as the DNA probe for hybridization (probe sequence shown in 
Table 2). Positive clones were characterized by gel electrophoresis for the size of 
the inserted multimer. Finally, the SlpIII sequences were subcloned using the REN 
site in the flanking adapter regions to specific locations of expression plasmids. 

15 The SlpIII protein had the following amino acid composition: 

SlpIII 1178 AA MW 83,000 

{ f m) DPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
GAGS ( GAGAGS ) fc GAAGY 
20 [ (GAGAGS) 9 GAAGY] ,„ 

GAGAGSGAGAGSGAGAMDPGRYQLSAGRYHYQLVWCQK 

(fm) intends the initiation codon 

SlpIII Expression Vector 

25 Plasmid DNA pSY1086 is a pSY937 derivative containing 19 repeats of 

SlpIII (3.5 kb). This plasmid DNA was digested with Nrul and PvuII and the 
fragments separated by agarose gel electrophoresis. The purified SlpIII multimer 
was then cloned in plasmid pSY751 digested with PvuIII REN. Several clones were 
analyzed and one (pSY1008) was chosen to be used in expression experiments and 

30 SlpIII purification. 

The ampicillin drug resistance gene of pSY1008 was substituted with the 
kanamycin marker from pSYlOlO (produced by digestion of pSY633 with Dral and 
Sspl and insertion of Kan R obtained by Hindi digestion of pUC4K) and the 
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subsequent plasmid was called pSY1186. By removing the SlpIII portion of plasmid 
pSY1186 with Banl t a new plasmid, pSY1262, was generated. This plasmid 
contains a unique BanI site which allows for the direct ligation of fragments 
containing BanI ends obtained by polymerization of monomers. This plasmid has 
5 been used to generate plasmids containing inserts for the following proteins: SELP1, 
2, 3, and Slp4. 



Production End Purification of SlpIII Cell Culture 

E. coli are cultured in the following medium: 
io Medium C 





gZl 


yeast extract 


20 


casamino acids 


20 


peptone 


20 


gelatin peptone 


20 


KH 2 P0 4 


2 


K 2 HP0 4 


2 


Na 2 HP0 4 7H 2 0 


2 


glucose 


2 


ampicillin 


0,1 



An overnight culture (500 ml - 1 1) which had been grown at 30<>C was used 
to inoculate 375 1 of media contained in a 500 1 fermentor. Fermentor conditions 
include a tachometer reading of 100 rpm, vessel back pressure of 5 psi and an air 

25 flow of 170 I/min in order to maintain dissolved O, at greater than 50%. 

Glucose (1 g/1) and ampicillin (0.05 g/1) were added to the fermentation 
when the culture reached an OD^ of 1.0 and again at 2.0. When the culture 
reached an OD^ of 2.0 the temperature was increased to 42<>C for 10 minutes and 
then lowered to 38°C for 2 hours. The culture was then chilled to 10°C and cells 

30 were harvested by centrifiigation in a continuous centrifuge and frozen at -70°C until 
processed. Yields from two separate fermentations were 7.3 kg and 5.2 kg wet 
weight of cells. 
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It should be noted that other media can be used and, with different plasmids, 
various selection conditions can be imposed (i.e., substitution of kanamycin 
selection for ampicillin). These conditions have been used in laboratory scale 
fermentations (10 1 volumes). 

5 

Cell Lysis 

Method 1. Cells were thawed and suspended to a concentration of 1 kg wet 
weight/6 1 in 50 mM Tris-HCl pH 7.0, 1 mM EDTA and broken by 2 passages 
through an APR Gaulin cell disrupter at 8000 psi. During this lysis procedure the 

10 cells were kept cold with an ice bath. The cell lysate was then centrifuged at 

26.000xg with a continuous centrifuge, such as the T2-28 rotor in a Sorvall RC5B 
refrigerated centrifuge operated at 4©C. Under these conditions greater than 90% of 
the SlpIII produced can be found in the pellet. The supernatant did contain some 
product which could be recovered by NH 4 S0 4 precipitation as described below. The 

15 pellet was extracted with LiBr as described below. 

Method 2. Frozen cells were thawed and resuspended to a concentration of 
1 kg wet weight in 6 L in 50 mM Tris-KCl pH 7.0, 10 mM EDTA, and 5 mM 
PMSF to inhibit protease activity. Cells were stirred in this buffer at room 
temperature for 0.5 to 2 hours, then lysozyme was added to a concentration of 1 g/1 

20 and incubation is continued for 20 minutes. p-Mercaptoethanol was then added to 
70 mM and the detergent NP40 was then added to a final concentration of 1 % for 
20 min while continuously stirring the cell suspension. Then MgCU was added to 
50 mM followed by DNAse at a concentration of 1 mg/l and incubation was 
continued at room temperature for 20 minutes. The cell lysate was then centrifuged 

25 as in method 1 at 26,000xg in a continuous centrifuge and the supernatant was 
collected and passed through the continuous centrifuge a second time at 26,000xg. 
The supernatant resulting from this second centrifugation contained < 5% of the 
total SlpIII, but what was there could be recovered with NH«S0 4 as described below. 
The pellets resulting from the 1st and 2nd 26,000xg centrifugations were combined 

30 and extracted with LiBr as described below. 

Method 3. For this method, a strain of E. coli was used that contains a 
second plasmid which encodes the T7 phage lysozyme. This plasmid is compatible 
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with the plasmid encoding the SlpIII gene and the drug resistance determinant. The 
strain was grown in the same medium and under the same conditions as in the first 
two methods. However, due to the production of the T7 lysozyme inside the cells, 
their cell wail was weakened and they could be easily lysed at the completion of the 

5 fermentation by the addition of EDTA to > 100 mM and NP40 to a concentration 
of from 0.5 to 1 .0% v/v. Lysis could also be achieved by the addition of chloroform 
(20 ml per liter) of fermentation broth instead of NP40. Alternatively, cells could 
be collected by centrifiigation prior to lysis, resuspended to 1 kg wet weight in 6 L 
in Tris-EDTA as described in the first two methods and then lysed by the addition of 

10 NP40 or chloroform. Following cell lysis by either method the lysate was 

centrifuged in a continuous rotor at 26,000xg as described in the first two methods. 
As with those methods, LiBr extraction of the pellet and NH 4 S0 4 precipitation of the 
supernatant was used to recover the product. 

15 Purification of SlpIII 

The pellet obtained by centrifiigation of the cell lysate at 26,000xg as 
described above was extracted with an equal volume of 9 M LiBr. The salt solution 
was added and the pellet was evenly suspended by stirring at room temperature 
(RT). The mixture was stirred for 1 hour at RT after an even suspension was 

20 obtained. The mixture was then centrifuged at 26,000xg in a continuous rotor at 
4©C or at RT to generate a pellet and a supernatant fraction. The supernatant was 
saved and the pellet was re-extracted with another equal volume of 9 M LiBr as 
above. After mixing for 1 hour the mixture was centrifuged at 26,000xg and the 
supernatant from this centrifiigation was combined with the supernatant from the 

25 first LiBr extraction and allowed to stand at 4©C overnight. Approximately 90% of 
the SlpIII contained in the cell lysate 26,000xg pellet was extracted by LiBr using 
this procedure. 

After the LiBr extract stands overnight at 4°C a precipitate formed, was 
removed by centrifiigation at 26,000xg and was discarded. The supernatant was 
30 then placed in dialysis bags and dialyzed against several changes of dH 2 0 for 2 days. 
As the LiBr was removed by dialysis the SlpIII product precipitated in the dialysis 
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bags. The precipitate was collected by centrifugation and washed 2-3 times with 
dH 2 0. The final washed product was centriftiged and dried by lyophilization. 

For the recovery of SlpIII from the 26,000xg supernatant fractions, NH 4 S0 4 
precipitation was used. Solid NH 4 S0 4 was slowly added to the sample which was 

5 maintained at 4oC, until 38% saturation was achieved (231 g/1). The mixture was 
then stirred at 4°C for 2-3 hours. The precipitate was recovered by centriftigation in 
a continuous flow centrifuge and washed 4-5 times with an equal volume of distilled 
H 2 0 or with 0.5% SDS in H 2 0. After each wash the precipitate was recovered by 
continuous centrifugation. The pellet became increasingly white with successive 

10 washes as contaminating protein was removed. SlpIII was recovered as a washed 
pellet and was dried by lyophilization. 

Trypsin Treatment Step of SlpIII 

SlpIII was suspended in 50 mM Tris HC1, pH 8.0, 0.1 M NaCl buffer, and 
15 was placed in a 37°C water bath, and TPCK treated trypsin solution was mixed into 
the suspension. The final trypsin concentration was 0. 1 %. After 3 hours, the 
solution was centriftiged at 16,000xg for 15 min, the pellet was washed with a half 
equal volume of 0.5% SDS in H 2 0 first, then with distilled water. After each wash 
the pellet was recovered by centrifugation. The final product was resuspended in 
20 water and kept at 4°C for further analysis. 

With the trypsin treatment, SlpIII was purified to 99.4% purity. 

Physical Measurements of SlpIII 

Physical measurements of the purified silk-like proteins have been compared 

25 with those of Bombyx mori silk in order to establish that the repetitive amino acid 
polymers produced microbiologically accurately mimic the properties of naturally 
occurring polymers. Physical measurements were performed to confirm the model 
of anti-parallel chain pleated sheet conformation for the crystalline regions of 
Bombyx mori silk fibroin (Marsh, Corey and Pauling, Biochem. Biophys. Acta 

30 (1955) 16; Pauling and Corey, Proc. Natl Acad. Sci. USA (1953) 39:247). 
Preliminary analysis of x-ray diffraction patterns obtained from Sip films are 
consistent with those described by Fraser, MacRai, and Steward (1966) (Table 4). 
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Circular Dichroic (CD) and Fourier transform infrared (FTIR) spectroscopic 
analysis of SlpIII were consistent with a high degree of extended P and P-turn 
conformations. Comparisons of the spectra obtained from SlpIII with that of 
naturally occurring silk fibroin in various solvents (Isuka and Young, Proc. Natl. 
5 Acad. ScL USA (1966) 55:1175) indicated that SlpIII in solution consists of a 
mixture of the random and highly ordered structures seen in silk fibroins. 



Table 4 

Mat erial 

10 (AG). 

(AGAGSG) n 
CTP fraction 
Native fibroin 

15 SlpIII 

Referenced in Fraser et al., J. Mol Biol (1966) 19:580 



aJA) 






9.42 


6.95 


8.87 


9.39 


6.85 


9.05 


9.38 


6.87 


9.13 


9.40 


6.97 


9.20 


9.44 


6.95 


9.30 


9.38 


6.94 


8.97 



Example 4 

20 EBSI Gene Construction: 

Six oligonucleotide strands were synthesized and purified as described 
previously. 



(HIII) Banll Stul 
25 i. 5' AGCTGGGCTC TGGAGTAGGCCTG3' 

ii . 5 ' AATT CAGGCCTACTCCAGAGCCC3 ' 
(ER1) Stul Banll 

30 (HIII) BanI 

i i i . 5 ' AGCTTGGTGCC AGGTGTAGGAGTTCCGGGTGTAGGCGTTCCGGGAGTTGG 
TGTACCTGGAGTGGGTGTTCCAGGCGTAGGTGTGC3 ' 

(Xmal) 

35 i v . 5 'CCfiQSCACACCTACGCCTGGAACACCCACTCCAGGTACACCAACTCCCGGA 
ACGCCTACACCCGGAACTCCTACACCTS££A££A3 ' 

BanI 



40 
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(Xmal) Ahall 
v . 5 ^ CCGGG GTAGGAGTACCAGGGGTAGgCGTCCCTGGAGCGGGTGCTGGTAG 
CGGCGCAGGCGCfi22£X£CGG AGTAGG££XQ££G 5 ' 
Banll BanI 

5 

(ERI) BanI Banll 
Vi . 5 ' AATTCGGCACC CCTACTCCG GAGCCC GCGCCTGCGCCGCTACCAGCACCCG 
CTCCAGG GACGCCT ACCCCTGGTACTCCTACC3 ' 
Ahall 

10 

Oligonucleotide strands (iii), (iv), (v) and (vi) were annealed and ligated 
with the DNA of plasmid pBSml3(+) (Stratagene) which had been digested with 
Hindlll and EcoRI. The products of this ligation reaction were transformed into £. 
coli strain JM109. Transformant colonies were selected for resistance to ampicillin. 

15 Colonies were screened for their hybridization with 32 P-labelIed oligonucleotides 
(iii), (v). Plasmid DNA from several positively hybridizing clones was purified and 
sequenced. Two of the plasmids, pSY1292 and pSY1293. contained the sequence 
shown for oligonucleotides (iii), (v) and (iv), (vi). These sequences contained all of 
the nucleotides present in these synthetic oligonucleotides except one. A G:C 

20 basepair was missing at position 7 (iii). The lack of this basepair obstructed one of 
the BanI sites. In order to introduce a second Banll site at the 5' end of the gene 
fragment, oligonucleotides (i) and (ii) were annealed and ligated with plasmid 
pBSml3(+) which had been digested with Hindlll and EcoRI. Plasmid DNA from 
the transformant colonies resistant to ampicillin was purified. Two plasmids, 

25 pSY1295 and pSY1296, which were digestible with StuL a unique site contained in 
the oligonucleotide sequence, were sequenced. They were both shown to contain the 
sequence shown for oligonucleotides (i) and (ii). Plasmid DNA From pSY1292 was 
digested sequentially with Hindlll, SI nuclease, and EcoRI. The digestion products 
were separated by electrophoresis in an agarose gel and the DNA fragment of 

30 approximately 150 basepairs was excised from the gel. This DNA fragment was 
ligated with plasmid DNA pSY1296 which had been digested with StuI and EcoRI. 
The products of this ligation reaction were transformed into E. coli strain JM109 and 
were selected for resistance to ampicillin. Colonies were screened for hybridization 
to 32 P-labelled oligonucleotide (v). The plasmid DNA from two positively 

35 hybridizing clones was purified and sequenced. These plasmids were named 
pSY1297 and pSY!298. They contained the following sequence: 
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(Hindlll) Banll 

fiGCTGGGCTCT GGAGTAGGTGTGCCAGGTGTAGGAGTTCCGGGTGTAGGCGTTCCGGGAG 60 

TCGACCCGAGACCTCATCCACACGGTCCACATCCTCAAGGCCCACATCCGCAAGGCCCTC 

5 Xmal 

TTGGTGTACCTGGAGTGGGTGTTCCAGGCGTAGGTGTG££CfifififiTAGGAGTACCAGGGG 120 
AACCACATGGACCTCACCCACAAGGTCCGCATCCACACGGGCCCCATCCTCATGGTCCCC 

Ban 1 1 

1 0 TAGGCGTCCCTGGAGCGGGTGCTGGTAGCGGCGCAGGCGCSfiSClCCGGAGTAGGGGTGC 180 

ATCCGCAGGGACCTCGCCCACGACCATCGCCGCGTCCGCGCCCGAGGCCTCATCCCCACG 

ECORI 
CGAATTC 
15 GCTTAAG 



EBSI Mnhhner Gene Assembly: 

The BanI acceptor plasmid pSY937 was modified in order to accept Banll 
terminal cohesive DNA fragments. Two oligonucleotides were synthesized for this 
20 purpose. 

(BamHI) Dral Sspl Nrul Banll 
vii . q ' £fflfi<^&TG TTTAAATATT CTCGCG]&CGTTTTTCThTGQQ£X£GATGTGT 
TnrrnTC^^ATG GATATCAGCTG 3 ' 
Fspl ECORV PvuII 

25 

{BamHI ) PvuIIEcoRV Fspl Banll 

viii . «; ^&TrrAGrTGATAC CATGCGCAGGGTAACACATCGAGCCCATACAAAAA 
rr;TT^QC<?AG AATATTTAAA CATAG3 ' 
Nrul Sspl Dral 

30 

Oligonucleotides (vii) and (viii) were annealed and ligated with plasmid 
DNA pSY937 which was digested with BamHI. The products of this ligation were 
transformed into E. colt strain JM109 and colonies were selected for resistance to 
chloramphenicol. Transformant colonies were screened by hybridization to 32 P- 

35 labelled oligonucleotide (vii). Plasmid DNA from two positively hybridizing 

clones, pSY1299 and pSY1300, contained the sequence shown for oligonucleotides 
(vii) and (viii), as determined by DNA sequencing. 

Plasmid DNA pSY1298 was digested with Banll and the digestion fragments 
separated by agarose gel electrophoresis. The EBSI gene fragment, approximately 

40 150 base pairs, was excised and purified by electro-elution and ethanol precipitation. 
Approximately 1 /xg of purified fragment was self-ligated in order to produce 
multimers ranging in size from 450 bp to 6,000 bp. The products of the self- 
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ligation were then ligated with plasmid DNA pSY1299 which had been digested with 
Banll. The products of this ligation reaction were transformed into £. coli strain 
HB101 . Transformants were selected for resistance to chloramphenicol. Plasmid 
DNA from individual transformants was purified and analyzed for increased size due 
5 to EBSI multimer DNA insertions. Ten clones (pSY 1240- 1249) with inserts ranging 
in size from 1.5 Kbp to 4.4 Kbp were obtained. 

p [X prP^inn nf HBSI Multimer Gene: 

One of these clones, pSY1248, which contained a 4 Kb EBSI multimer gene 

10 was recloned in the AP R expression vector, pSY751. Plasmid DNA from pSY1248 
was digested with Nrul and Pvull, separated by agarose gel electrophoresis, and the 
DNA band corresponding to the EBSI multimer gene was excised and purified by 
NACS purification. DNA from plasmid pSY751 was digested with Pvull and 
ligated with the NruI-PvuII fragment from pSY1248. The products of this ligation 

15 were transformed into E. coli HB101, and the transformants selected for resistance 
to ampicillin. Two clones were isolated containing the new plasmid pSY1280. E. 
coli cells containing pSY1280 were grown at 30oC to an OD^ of 0.7 and then 
shifted to 42°C for 1.5 hours. The proteins produced by these cells was analyzed by 
SDS-PAGE. The separated proteins were transferred to nitrocellulose paper and 

20 detected by immunoreactivity with anti-ELP rabbit serum. A strongly reactive 
protein band was observed with an apparent molecular weight of 120 kDal. 

The Ampicillin drug resistance gene of pSY1280 was substituted with the 
Kanamycin marker and the subsequent plasmid was called pSY1332. This plasmid 
was used in fermentation for the purification of EBSI (see Methods). 

25 

pSY1332/pSY12B0 EBSI Protein 1413 AA MW 113 , 159 

MDP WLQRRDWENPGVTQLNRLAAHP P FAS ERFCMGS 
[ (GVGVP) , (GAGAGSGAGAGS) ,) 2i 
30 MCYRAHGYQLSAGRYHYQLVWCQK 
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BmfigaiiflD ™s* Protein: 

E. coli strain HB101 containing plasmid pSY1280 was fermented in 10 L 

volume. The cells were concentrated by filtration and further harvested by 

centriftigation. Pelleted cells were stored frozen at -70<>C until processed. Frozen 

5 cells were thawed on ice and suspended in 4 ml of 50 mM Tris-HCl pH 7.0, 10 mM 
EDTA, 5 mM PMSF per gram wet weight of cells. The cells were broken by 
French pressing twice at 15,000 psi and then cooled to OoC. The crude lysate was 
cleared by centriftigation at 26,000xg for 20 minutes. The supernatant proteins were 
precipitated by addition of solid ammonium sulfate to 20% of saturation (114 g/1). 

10 The precipitate was collected by centriftigation at 10 t 000xg for 10 minutes. The 
pellet was resuspended in 10 ml of H 2 0 and dialyzed against 10 mM Tris pH 8.0, 
0. 15M NaCl at 4<>C. The dialyzed solution was digested with 0. 1 % Trypsin 
(Sigma) for 1.5 hours at room temperature, and reprecipitated with 20% ammonium 
sulfate. The precipitated protein was resuspended in H 2 0 and dialyzed against 10 

15 mM Tris pH 7.0, 1 mM EDTA at 4oC. The protein purity of this sample was 
analyzed by amino acid composition and determined to be 83%. 

Plastic Properties of EBSI Protein: 

The soluble preparation of semi-purified EBSI protein described above was 

20 incubated at 37<>C for 30 minutes and centrifuged at 10,000xg for \0 min at room 
temperature. This treatment caused the EBSI protein to aggregate, become 
insoluble, and pellet into a translucent solid. The solid was resistant to mechanical 
disruption either by vortexing or by maceration using a glass rod. The solid could 
be cut with a razor blade into strips which exhibited a high degree of elasticity. 

25 These strips fully retained their shape after repeated extensions and relaxations. 
They resisted compression with no apparent irreversible deformation of structure. 

EBSI Purification 

EBSI sample (~70% pure) was dialyzed in 50 mM Tris HC1, 50 mM NaCl, 
30 pH 8.0 at 4°C overnight with one change of buffer. If precipitation was observed, 
the sample was centrifuged at 27,000xg for 15 min at 4<>C. All remaining steps 
were performed at 4oC. The supernatant was applied to a DEAE-Sephacel column 
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which had been equilibrated with 50 mM Tris HC1, 50 mM NaCl, pH 8.0. The 
flow through fractions which contained EBSI were collected and pooled. NaCl was 
added to the pooled fractions from DEAE-Sephacel column to make a final 
concentration of 2 M NaCl in the sample. Insoluble material was removed by 

5 centrifugation at 27,000xg for 20 minutes. The supernatant was then loaded onto 
Phenyl-Sepharose column which was equilibrated with 50 mM sodium phosphate 
buffer, pH 7.0, with 2 M NaCl. The column was washed extensively with buffer 
until no eluting protein was detected by A 280 . The column was then eluted stepwise 
with 50 mM sodium phosphate buffer, pH 7.0 and finally with water. The EBSI 

10 active fractions were pooled and stored at 4°C for further analysis. 

With the addition of these steps to the previous procedures, 100% pure EBSI 
was obtained. 

Example 5 

15 ELPI Construction and Expression 

Two oligonucleotide strands were synthesized and purified as described in 
the Methods section. 

(EcoRI) BanI Smal 
i ) 5 ' AATTCGGTGCC CGGTGTAGGAGTTCCGGGTGTAGGCGTT CCCGGGG TAG 

20 GCGTTCCGGGAGTAGGGS1GCCA3 ' 

BanI 

BanI Snal 
i i ) 3 ' G CCACGGG CCACATCCTCAAGGCCCACATCCGCCASGS££CCATCCGCA 

25 AGGCCCTCATCC CCACSfiIICSA 5 ' 

BanI (Hindlll) 

The two oligonucleotide strands were annealed and ligated with the DNA of 
plasmid pBS ml3(+) (Stratagene) which had been digested with RENs Hindlll and 
30 EcoRI. 

The products of this ligation reaction were transformed into E. coli strain 
JM109. Transformant colonies were screened for their hybridization with 32 P- 
labeled oligonucleotide (i). Plasmid DNA from positively hybridizing clones was 
purified and sequenced. One plasmid, pSY1287, contained the sequence shown for 
35 oligonucleotides (i) and (ii). 
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Plasmid DNA from pSY1287 was digested with BanI REN and the digestion 
fragments were separated by agarose gel electrophoresis. The ELPI gene fragment, 
approximately 60 bp, was excised and purified by NACS column. Approximately 1 
/ig of purified fragment was self-ligated in order to produce multirners ranging in 

5 size from 300 bp to 5000 bp. 

The products of the self-ligation were then ligated with plasmid DNA 
pSY937 which had been digested with REN BanI. The product of this ligation 
reaction was transformed into E. coli strain HB101. Transformants were selected 
for resistance to chloramphenicol. Plasmid DNA from individual transformants was 

10 purified and analyzed for increased size due to ELPI multiple DNA insertions. Four 
clones (pSY1388-1391) with inserts ranging in size from 1.0 kbp to 2.5 kbp were 
obtained. These clones were recloned in the APR expression vector pSY751 . The 
clones obtained (pSY1392-1395) were used for expression of ELPI. 



15 The ELPI protein had the following amino acid composition: 

pSY139S ELPI Protein 859 AA MW 72,555 

MDPWLQRRDWENPGVTQLNRLAAHPPFARNILAIRW 
{ (VPGVG) 4 ] 40 VPWTRVDLSAGRYHYQLVWCQK 



20 s pi.Pl dene Construction and Expression 

Two oligonucleotide strands were synthesized and purified as described in 
the Methods section. 



25 



Fspl PvuII SnaBI (PstI) 

( i ) 5 ' - OTGCGCA G CTGG TACGTA GCTGCA - 3 ' 

(P3tl) PvuII 

(ii) 7 ' - ArCTCACGCGTCGACCATGCATCG- 5' 

Fspl SnaBI 



30 These oligonucleotide strands were annealed and ligated with plasmid 

pSY1304 which had been digested with PstI REN (pSY1304 differs from pSY857 by 
having a monomeric unit in place of the trimeric unit of pSY857). Plasmid DNA 
from transformant colonies resistant to chloramphenicol was purified. One plasmid, 
pSY1365, which was digestible with REN SnaBI, was sequenced and proven to be 

35 correct. 
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ELPI gene fragment purified as described (ELPI construction and 
expression) was treated with Mung Bean Nuclease as described by supplier 
(Stratagene). The DNA fragments mixture was then ligated with plasmid DNA 
pSY1365 which had been digested sequentially with RENs Fspl, SnaBI and calf 

5 intestinal phosphatase. The products of this ligation reaction were transformed into 
£. coli strain HB101 and were selected for resistance to chloramphenicol. Plasmid 
DNA from individual transformants was purified and analyzed for the ELPI 
monomer DNA insertion. Two plasmids, pSY1366 A and B, were sequenced. 
They were both shown to contain the ELPI DNA sequence in the correct orientation. 

10 Plasmid DNA pSY1365 was digested with REN BanI and the DNA fragment 

containing the SELP1 monomer was gel purified. To create multimers, 1 ^g of the 
SELP1 DNA fragment was self-ligated. Multimers were obtained ranging in size 
from 500 bp to 10 kbp. The SELP1 multimers were cloned into the BanI site of 
pSY1262. 

15 Positive clones were characterized by gel electrophoresis for the size of the 

inserted multimer and used for expression and protein analysis. 



pSY1396 SELP1 Protein 2025 AA MW 148,212 

20 MDPWLORRDWENPGVTQLNRLAAHPPFASDPMGAGS ( GAGAGS ) 4 

[GAA (VPGVG) , VAAGY (GAGAGS) ,] „ 

GAA < VPGVG ) 4 VAAGY ( GAGAGS ) 2 G AGAMDPGRYQLS AGRTH YQLVWCQK 



SELP2 - Monomer Construction 
25 Plasmid DNA pSY1298 was digested with Banll REN and the EBSI gene 

fragment was purified as described previously. The EBSI monomer fragment was 
ligated into pSY1304 (pSY937 containing a monomer of SlpIII, constructed as 
pSY857) which had been digested with Banll REN and treated with calf intestinal 
phosphatase). 

30 The products of the ligation mixture were transformed in £. coli strain 

HB101. Transformants were selected for resistance to chloramphenicol. After 
restriction analysis of several isolates, one plasmid was chosen, pSY1301 , 
containing a DNA fragment corresponding to the SELP2 monomer gene. 
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SELE2 - Mlllf'P 1 * GfiPC Awmhlv and Expression 

Plasmid DNA pSY1301 was digested with REN BanI and the DNA fragment 
containing the SELP2 "monomer" was gel purified. To create multimers, 1 fig of 
the SELP2 DNA fragment was self-ligated. Multimers were obtained greater than 
5 12 kb in size. 

The SELP2 multimers were cloned into the BanI site of pSY1262. Positive 
clones were characterized by gel electrophoresis for the size of the inserted 
multimer. The clones with inserts ranging in size from 1.5 kb to 11 kb were 
selected. Plasmid DNA pSY1372 containing an insert of 6 kb (18 repeats) was used 
10 for further analysis and protein purification. 

g Fl P? - Prrf'" Purification 

E. coli strain HB101 containing plasmid pSY1372 was fermented according 
to the procedure described in Methods for fermentation. The cells were harvested 

15 by centrifugation. Pelleted cells were stored frozen at -70°C until processed. 

Frozen cells were thawed on ice and suspended in 4 ml of 50 mM TrisHCl, pH 7.0, 
10 mM EDTA, 5 mM PMSF per gram wet weight of cells. The cells were broken 
by passing through a Gaulin cell disrupter at 8,000 psi. The crude lysate was 
cleared by centrifugation at 26,000xg for 20 minutes. The supernatant, which 

20 contained > 75 % of the SELP2 protein, was precipitated by addition of 20% 
ammonium sulfate (1 14 g/L). The precipitate was collected by centrifugation at 
10,000xg for 10 minutes. The pellet was resuspended in 10 ml of H 2 0 and dialyzed 
against 10 mM Tris pH 8.0, 0.15 M NaCl at 4<>C. The dialyzed material was 
centrifuged at 26,000xg for 15 min in order to collect the insoluble fraction of 

25 protein which contained approximately 10% of the SELP2 protein. This insoluble 
protein pellet was washed twice in 0.2% SDS at 50oC for 30 min with occasional 
shaking. The insoluble protein was collected each time by centrifugation at 
26,000xg for 15 min followed by a wash of 50% ethanol. The final protein pellet 
was resuspended in water and analyzed by Western blot analysis and amino acid 

30 composition. By Western blot the SELP2 protein appears to be homogeneous in size 
consistent with its large molecular weight (> 150 kDal). By amino acid composition 
the SELP2 preparation is approximately 80% pure and the observed molar ratio of 
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amino acids (Ser:Gly:Ala:Pro:Val:Tyr) agrees very closely with the expected 
composition as predicted from the SELP2 sequence present in pSY1372. 

PSY1372 SELP2 Protein 2055 AA MW 152 , 354 

5 MDPWLQRRDWENPGVTQLNRLAAHPPFASDPMGAGS (GAG AGS ) 7 { GVGVP ) , 

[ (GAGAGS) 6 GAAGY (GAGAGS) 5 (GVGVP) ,] , 7 

(GAGAGS ) « GAAGY (GAGAGS ) ? GAGAMDPGRYQLSAGRYHYQLVWCQK 

SELP3 - Construction and Expression 

10 Plasmid DNA pSY1301 was partially digested with REN Haell and the 

digestion fragments separated by agarose gel electrophoresis. The larger DNA 
fragments were excised and purified by NACS column. The purified fragments 
were self-ligated, the ligation reaction was heated at 70°C for 15 min to inactivate 
the T4 DNA ligase and eventually digested with REN Pstl. The digestion mixture 

15 was then transformed into E. coli strain JM109. Transformants were selected for 
resistance to chloramphenicol. Plasmid DNA from individual transformants was 
purified and analyzed for: (1) resistance to REN Pstl; and (2) deletion of 60 bp 
Haell fragment contained within the SELP2 gene fragment. One clone (pSY1377) 
satisfied both requirements. Plasmid DNA from pSY1377 was digested with REN 

20 BanI and the DNA fragment containing the SELP3 monomer was gel purified. To 
create multimers, 1 jxg of the SELP3 DNA fragment was self-ligatedr Multimers 
were obtained ranging in size from 500 bp to 10 kbp. The SELP3 multimers were 
cloned into the BanI site of pSY1262. Positive clones were characterized by gel 
electrophoresis for the size of the inserted multimer and used for expression and 

25 protein analysis. 

PSY1397 SELP3 Protein 2257 AA MW 168 , 535 

MDPVVLQRRDWENPGVTQLNRLAAHPPFASDPMGAGS (GAGAGS ) 7 
30 [(GVGVP), (GAGAGS) 0 ] u 

(GVGVP ) B (GAGAGS ) s GAGAMDPGRYQLSAGRYHYQLVWCQK 
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SLE4 - Construction and Expression 

Plasmid DNA from pSY1304 was partially digested with REN Haell and the 
digestion fragments separated by agarose gel electrophoresis. The larger DNA 
fragments were excised and purified by NACS column. The purified fragments 
5 were self-ligated, the ligation reaction was heated at 70<>C for 15 min to inactivate 
the T4 DNA ligase and eventually digested with REN Pstl. The digestion mixture 
was then transformed into £. coli strain JM109. Transformants were selected for 
resistance to chloramphenicol. Plasmid DNA from individual transformants was 
purified and analyzed for: (1) resistance to REN Pstl; and (2) deletion of 60 bp 

10 Haell fragment contained within the SELP2 gene fragment. One clone (pSY1378) 
satisfied both requirements. Plasmid DNA pSY1378 was digested with REN BanI 
and the DNA fragment containing the SLP4 monomer was gel purified. To create 
multimers, 1 jig of SLP4 DNA was selMigated. Multimers were obtained ranging 
in size from 300 bp to 6 kbp. The SLP4 multimers were cloned into the BanI site of 

15 pSY1262. Positive clones were characterized by gel electrophoresis for the size of 
the inserted multimer and used for expression and protein analysis. 

pSY1398 SLP4 Protein 1101 AA MW 76,231 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPMGAGS [ (GAGAGS) 6 ] 27 
20 (GAGAGS) , GAGAMDPGRYQLSAGRYHYQLVWCQK 

FCB-SLPIIT fSLPR Construction and Expression; 

The SLPIII polymer was chosen as a backbone structure for insertion of a 
biologically functional sequence because of its predicted structure, allowing for 

25 fabrication of useful products; having good structural properties for use in a wide 
variety of applications; having B-turn structures between interactive strands; and 
allowing for substitution of the turn sequences with other sequences. The 
fibronectin cell-binding domain, amino acids 1405-1512, has a strong turn 
propensity, with the tripeptide RGD providing for cell attachment, predicted to be 

30 present within a hydrophilic loop between adjacent B-strands. A 10 amino acid 
sequence spanning this proposed loop structure (referred to as fibronectin cell- 
binding or FCB sequence) was chosen to constitute the functional block of amino 
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acids to be inserted within the SLPIII backbone. The insertion site within the 
SLPIII backbone was chosen to correspond with the amino-acid sequence GAAGY 
which is also predicted to provide a turn structure (Chou and Fassman, 
Biochemestry, 13:222-244 (1974)). The design allows for conservation of the FCB 
5 structure while causing minimal disruption of the SLPIII (GAGAGS) 9 B-strand 
crystal-packing domains. 

The SLPIII gene monomer contains a PstI restriction endonuclease site 
within the sequence encoding the proposed turn structure, GAAGY. This site was 
used to insert the synthetic DNA encoding the 10 amino acids of the FCB sequence. 
10 Two complementary DNA strands comprising the FCB site, 36 bases in length, were 
synthesized consisting of the sequence shown below: 

5' - GTGACTGGCCGTGGTGATAGCCCGGCTAGCGCTGCA -3' 

3' - ACGTCACTGACCGGCACCACTATCGGGCCGATCGCG -5' 

15 These oligonucleotides were purified according to the procedures described 

in Example 1, and cloned into the PstI site of pSY1304. pSY1304 DNA was 
digested with PstI and ligated with a mixture of the FCB oligonucleotides. The 
ligation reaction products were transformed into E. coli cells. Colonies containing 
the plasmid were selected on bacterial culture plates containing the antibiotic 

20 chloramphenicol. Individual colonies were grown and plasmid DNA purified and 
analyzed for the presence of the FCB oligonucleotide sequence by restriction 
digestion with Nhel. Plasmids containing this restriction site were subjected to 
DNA sequencing and two candidates were shown to be correct. The partial 
nucleotide sequence of one of these, pSY1325, and the encoded amino-acid sequence 

25 were as follows: 
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Pst I 

TCT GGA GCT GCA GTG ACT GGC CGT GGT GAT AGC CCG GCT 
SGAAVTGRGDSPA 

5 Pst I 

AGC GCT GCA GGC TAT GGA GCT GGC GCT GGC TCA GGT GCT 
SAAGYGAGAGSGA 

Ban I 

10 GGA GCA GGA AGC GGA GCG GGT GCC 
G A G S G A G 

The FCB-SLP monomer gene fragment was purified from pSY1325 by 
digestion with Banl, agarose-gel electrophoresis, and NACS purification (Example 

15 1). The monomer gene fragment was self-ligated and cloned into pSY937 which 
had been digested with Banl. The products of this ligation were transformed into £. 
coli and selected for growth on chloramphenicol. Plasmid DNA from individual 
colonies was analyzed for inserts containing multiple FCB-SLP monomer fragments 
by digestion with Nrul and EcoRV and electrophoresis on agarose gels. One clone 

20 was identified containing two inserts, one of approximately 2.1 kb and the other of 
2.8 kb. Both inserts were cloned individually and transferred to the expression 
vector pSY751 . Plasmid pSY1325 was digested with Nrul and PvuII and the 2. 1 
and 2.8 kb insert bands were purified. These DNA fragments were ligated with 
pSY751 that had been digested with PvuII. The products of this reaction were 

25 transformed into E. coli and selected for growth on the antibiotic ampicillin. 

Plasmid DNA from individual colonies was analyzed by restriction digestion for the 
presence of the FCB-SLP polymer gene. Two clones were identified, pSY1520 and 
1521, containing the 2.1 and the 2.8 kb inserts, respectively. 

E. coli cells containing pSY1520 and pSY1521 were grown at 30°C in LB 

30 medium containing 50 /xg/ml ampicillin to an OD^of 0.7. Production of the FCB- 
SLP polymer proteins were induced by increasing the culture temperature to 42°C 
for 1.5 hours. The cells were harvested by centrifugation and lysed in sample buffer 
containing sodium dodecylsulfate (SDS) and P-mercaptoethanol by heating at 100°C 
for 5 minutes. Samples of these lysates corresponding to 5 x 10 8 cells were applied 

35 to an 8% polyacrylamide gel containing SDS, electrophoresed, and transferred to 
nitrocellulose filters by electroblotting. The filters were incubated either with anti- 
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SLP or anti-FCB peptide antibody. Specific immunoreactivity with the anti-SLP 
antibody was observed for a protein band of approximately 75 kd in lysates of 
pSY1520, 95 kd in lysates of pSY1521, and 120 kd in lysates of the SLPHI clone 
pSYl 186. Reactivity with the anti-FCB antibody was observed only for the two 
5 FCB-SLP polymer bands. 

PSY1S20 FCB-SLPIII 767 AA MW 57,467 

(fM) DPVVLQRRDWENPG VTQLNRLAAHP PFAS DPM 
GAGS { GAGAGS ) t GAAVTGRGDS PAS AAG Y 
10 [ (GAGAGS ) 9 GAAVTGRGDS PAS AAGY ] , 

GAGAGS GAGAGS GAGAMDPGRYQLS AGRYH YQLVWCQK 

Elasmid pEIQlM Construction: 

Two oligonucleotide strands containing multiple cloning sites (MCS) were 
15 synthesized and purified as described in Example 1. 

Fokl Fokl Seal 

O.A) 5* - GTGCTGCGGATGCTCGAGATGGTGCATGCATGTACATCCGAGTACTTCGAT 

0 . B ) 3 ' - ACGCCTACGAGCTCTACCACGTACGTACATGTAGGCTCATGAAGCTA 

20 

After annealing, the two oligonucleotide strands were ligated with pSY937 
which had been digested with Ban! and EcoRV RENs. The product of the ligation 
mixture was transformed into E, coli and selected on bacterial plates containing the 
antibiotic chloramphenicol. Plasmid DNA from individual colonies was analyzed on 
25 agarose gel electrophoresis after digestion with Seal and StuI RENs. One plasmid, 
pPT0124, contained the expected DNA fragment. 

The new MCS were then moved to plasmid pSY1367. This plasmid is a 
derivative of pSY1299, which was digested with Neil REN and the large DNA 
fragment was purified by agarose gel electrophoresis and NACS purification. The 
30 purified DNA fragment was treated with DNA Polymerase (Example 1), ligated, 
then digested with Fokl prior to transformation in £. coli strain HB101. Plasmid 
DNA from single colonies was purified and analyzed by restriction digests. One 
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plasmid, pSY1366, was found to be correct and lacking the only Fokl site present in 
pSY1299. 

Two oligonucleotide strands were synthesized and purified as described in 
Example 1: 

5 (Banll) Fokl 

! . A ) 5 ' - CTACATGTGTTACACATCCCGTGC 

l.B) 3'- CCGAGATGTACACAATGTGTAGGGCACG 

Oligonucleotide strands l.A and l.B were annealed and ligated with the 
10 DNA of plasmid pSY1366 which had been digested with Banll and Fspl RENs. The 
products of this ligation reaction were transformed into E. coli strain HB101. 
Plasmid DNA from transformed colonies was purified and digested with Fokl. 
Clones which linearized with Fokl were sequenced. Plasmid pSY1367 contained the 
desired MCS sequence and was chosen for subsequent constructions. 
15 Plasmids pPT0124 and pSY1367 were digested with Nrul and Ncol and the 

DNA fragments were purified by agarose gel electrophoresis and NACS 
purification. The small fragment (approximately 500 bp) from pPT0124 was ligated 
with the large fragment from pSY1367. The product of the ligation mixture was 
transformed into £. coli. Plasmid DNA from single colonies was purified and 
20 analyzed by restriction digests and DNA sequencing. One plasmid, pPT0134, 

contained the desired sequence and was used as the acceptor vector for further DNA 
constructions. 

SELEE Construction and Expression: 

25 Plasmid DNA pSY1521 was digested with BanI REN and the SLPF (FCB- 

SlpIII) monomer was purified using NACS column (see Example 1). The DNA 
fragment was ligated with pPT0134 previously digested with Fokl REN, treated 
with calf intestinal phosphatase (see Example 1), and subsequently purified using 
NACS column. The product of this ligation reaction was transformed into E. coli 

30 strain HB101 . Transformants were selected for resistance to chloramphenicol. 
Plasmid DNA from transformed colonies was purified and digested with Fokl. 
Clones with the correct restriction pattern were sequenced. Plasmid pPT0141 
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contained the desired SLPF monomer sequence and was chosen for subsequent 
constructions. 

Plasmid pSYI377 was digested with BanI REN and the SELP3 gene 
monomer DNA fragment was purified by agarose gel electrophoresis followed by 

5 NACS column. The purified SELP3 gene monomer, 268 bp, was ligated with 
plasmid DNA pPTOHl previously digested with BanI REN and purified using 
NACS column. The product of this ligation reaction was transformed into £. coli 
strain HB101. Transformants were selected for resistance to chloramphenicol. 
Plasmid DNA from transformed colonies was purified and digegsted with Fokl. 

10 Clones with the correct restriction pattern were sequenced. Plasmid pPT0146 
contained the desired SELPF monomer DNA. 

Plasmid DNA from pPT0146 was digested with Fokl REN and the digestion 
fragments were separated by agarose gel electrophoresis. The SELPF gene 
fragment, 477 bp, was excised and purified by NACS column (see Example 1). The 

15 purified fragment was ligated with plasmid pSY1262 which had been digested with 
REN BanI. The product of this ligation reaction was transformed into E. coli strain 
HBIOI.^ Transformants were selected for resistance to kanamycin. Plasmid DNA 
from individual transformants was purified and analyzed for increased size due to 
SELPF multiple DNA insertion. Several clones were obtained ranging in size from 

20 1 kbp to 6 kbp. One clone pPT0183, with an insert of approximately 2.9 kbp was 
chosen for expression and protein analysis. 

E. coli strain HB101 containing plasmid pPT0183 was grown as described in 
Example 1. The protein produced by these cells was analyzed by SDS-PAGE for 
detection of reactivity to SLP and ELP antibodies. In every analysis a strong 

25 reactive band was observed with an apparent molecular weight of approximately 100 
kD. 

PPT0183 SELPF 1011 AA MW 79,597 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
30 GAGS (GAGAGS ) 7 (GVGVP) , 

t (GAGAGS ) i ? G AAVTGRGDS P AS AAG Y (GAGAGS) 5 (GVGVP) ( ] 4 
(GAGAGS ) x ?G AAVTGRGD S PAS AAG Y (GAGAGS ) 7 
GAGAMDPGRYQLS AGR YHYQLVWCQK 
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Example 6 

PliTimift 1 rPTW* Construction: 

Plasroid pACYC184 (Chang, A.Y.C. and Cohen, S.N., J. Bacterid., 
134:1141-1156 (1978)) was digested with BanI REN, purified by agarose gel 

5 electrophoresis, and the DNA fragment corresponding to approximately 2,000 bp 
was further purified using a NACS column. This DNA fragment was filled in using 
DNA polymerase (see Example 1) and then self-ligated. The products of the 
ligation mixture were transformed into E. coli strain HB101 and selected on 
bacterial plates containing chloramphenicol at 30 pg/ml. Plasmid DNA from 

10 individual colonies was linearized by digestion with Eco47III. One clone, pPT0235, 
was used as the acceptor vector for subsequent DNA manipulations. 

Two oligonucleotide strands were synthesized and purified as described in 
Example 1: 



15 (Eco47III)PmeI Pmll Nrul BanI 

StuI EcoRV SnaBI(Snal) 

1.5'- GCTATGTTTAAACCACGTGTTCGCGATCCGGGTGCCGATCCAGGCCTGCGATATCAGTACGTA 
2.3'- CGATACAAATTTGGTGCACAAGCGCTAGGCCCACGGCTAGGTCCGGACGCTATAGTCATGCAT 
AMFKPRVRDPGADPGLRYQYV 

20 

The two oligonucleotide strands were annealed and ligated with the DNA of 
plasmid pPT0235 which had been digested with Eco47III and Snal RENs. The 
product of this ligation reaction was transformed into E. coli strain HB101 . Plasmid 
DNA from transformants was purified and digested with EcoRI in combination with 
25 Eco47III or Snal or Nrul RENs. Plasmid DNA from two clones that gave the 
correct digestion pattern was sequenced. One plasmid, designated pPT0285, was 
found to be correct and chosen for further constructions. 



rt 7 Construction and Expression: 
30 One oligonucleotide strand coding for the CLP 3.7 gene monomer (see Table 

5) was synthesized using an Applied Biosystems DNA synthesizer model 381 A and a 
2000A synthesis column supplied by Glen Research. After the synthesis, the 226 
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base DNA fragment was deprotected and cleaved from the column support by 
treatment in NH 4 OH at 55° for 6 hours. 



Table 5 

5 5 ' - ATGGCAGCGAAAGGGG ACCGGTGCCCCGGGTACTCCTGGTCCACAAGGTCTGCCGGGAAGCCCAGGG 
GCTCCGGGTACTCCAGGTCCGCAAGGCCTGCCGGGTTCACCGGGTGCTCCGGGAACTCCTGGCCCGC 
AGGGCTTGCCGGGATCCCCAGGTGCACCAGGAACGCCGGGACCTCAGGGTCTTCCGGGTAGCCCTGG 
TG CCTTTCCGCTAAAGTCCTG CCGT - 3 ' 

10 Two additional DNA strands were synthesized to be used as primers for PCR 

amplification. The synthesis and purification of these DNA primers was performed 
as described in Example 1. The two strands are: 

1. 5' - AAG AAG GAG ATA TCA TAT GGC AGC GAA AGG GGA CC -3' 

2 . 5' - CGC AGA TCT TTA AAT TAC GGC AGG ACT TTA GCG GAA A -3' 

15 

The PCR reaction was performed as described in Example 1. 

The DNA was resuspended and digested with BanI REN as described in 
Example 1. The digested DNA was purified as described in Example 1, and then 
ligated with pPT0285 previously digested with BanI, treated with SAP, and purified 
20 as described in Example 1 . The product of the ligation reaction was transformed 
into E. coli strain HB101 . Plasmid DNA from trnsformants was purified and 
analyzed as described below. Colonies were picked and transferred onto a plate and 
into a 0.5 ml microfuge tube containing 50 /xl of lysis buffer (1 % Tween 20, 10 
Tris-HCl pH 8.0, 1 mM EDTA). The tube was closed, incubated at 95 °C for 10 
25 minutes. And then cooled to room temperature. 5 *tl of lysate was added to 45 /il 
MasterMix (lx PCR buffer as described previously, 5 U Amplitaq, 200 iM dNTPs) 
in a 0.5 ml Perkin Elmer thin-walled Gene Amp™ reaction tube. Amplification ws 
performed in a Perkin Elmer DNA Thermal cycler model 480 for 30 cycles with the 
following step cycle of 1 minute each: 95 °C, 52°C, and 72°C. Aliquots from 
30 different reactions were analyzed by agarose gel electrophoresis using 1.5% Low 
Melting Point agarose in 0.5x TAE buffer. Plasmid DNA from the clones showing 
the correct size insert was purified and anlyzed by DNA sequencing. Plasmid 
pPT0310 contained the desired CLP 3.7 monomer sequence (see Table 6). 
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Table 6 

BanI Aval/Smal 
5' - GGTGCCCCGGGTACTCCTGGTCCACAAGGTCTGCCGGGAAGCCCA 
3 ' - CCACGGGGCCCATGAGGACCAGGTGTTCCAGACGGCCCTTCGGGT 
5 GAPGTPGPQGLPGSP 

Banll Gsul StuI Drain 

GGGGCTCCGGGTACTCCAGGTCCGCAAGGCCTGCCGGGTTCACCG 
CCCCGAGGCCCATGAGGTCCAGGCGTTCCGGACGGCCCAAGTGGC 
10 GAPGTPGPQGLPGSP 

BG1I BAMHI 
GGTGCTCCGGGAACTCCTGGCCCGCAGGGCTTGCCGGGATCCCCA 
CCACGAGGCCCTTGAGGACCGGGCGTCCCGAACGGCCCTAGGGGT 
15 GAPGTPGPQGLP GSP 

Eco01091 BanI 
GGTGCACCAGGAACGCCGGGACCTCAGGGTCTTCCGGGTAGCCCTGGTGCC - 3 
CCACGTGGTCCTTGCGGCCCTGGAGTCCCAGAAGGCCCATCGGGACCACGG - 5' 
20 GAPGTPGPQGLPGSP (G A) 



H P? , 7 Pnlvmer C onstruction: 

Plasmid DNA from pPT0310 was digested with BanI REN and the digestion 
fragments were separated by agarose gel electrophoresis. The CLP 3.7 gene 

25 fragment, 180 bp, was excised and purified by NACS column (see Methods). The 
purified fragment was ligated with plasmid pSY1262 which had been prepared as 
follows: pSY1262 plasmid DNA was digested with BanI REN and subsequently 
treated with Shrimp Alkaline Phosphatase (SAP) as described in Example 1. 

The product of this ligation reaction was transformed into £. coli strain 

30 HB101. Transformants were selected for resistance to kanamycin. Plasmid DNA 
from individual transformants was purified and analyzed for increased size due to 
CLP 3.7 multiple DNA insertion. Several clones were obtained and two of them 
containing inserts of approximately 1.25 kbp and 2.6 kbp (pPT0314 and pPT0312 
respectively) were chosen to be used for expression of CLP 3.7. 

35 

CI P 3.7 Analysis: 

E. coli strain HB101 containing plasmid pPT0312 or pPT0314 were grown 
as described in Example 1 . The proteins produced by these cells were analyzed by 
SDS-PAGE for detection of reactivity to CLP antibodies. In every analysis a strong 
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reactive band was observed with an apparent molecular weight of 130 kD and 50 kD 
respectively. 



pPT0312 CLP 3.7 837 AA MW 72,637 

5 MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
[ (GAPGTPGPQGLPGSP) 4 ] 13 
GAMDPGRYQLSAGRYHYQLVWCQK 

pPT0314 CLP 3.7 417 AA MW 37,060 

10 MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
I (GAPGTPGPQGLPGSP) 4 ] 6 
GAMDPGRYQLSAGRYHYQLVWCQK 

PP A fl]-A Pnlvmer Construction 
15 The Protein Polymer Adhesive Substrate (PPAS) polymer was designed to 

include a 17 amino acid oligopeptide block of human fibrin gamma chain within a 
structural backbone consisting of 3 complete repeats of a 15 amino acid peptide 
block of human collagen type I (GAPGTPGPQGLPGSP, the CLP3.7 monomer 
repeating amino acid sequence) . 

20 

PPAS 1 -A Gen e Monomer Synthesis and Construction 

The PPAS1-A amino acid monomer sequence with the fibrin gamma 

sequence shown in bold is as follows: 

(GAPGTPGPQGLPGSP) 3 GAPGTPGEGQQHHLGGAKQAGDVGSP 

25 

One oligonucleotide strand (see Table 7) was synthesized using an Applied 
Biosystems DNA synthesizer model 381A and a 2000A pore resin synthesis column 
supplied by Glen Research. During the synthesis, the required interrupt-pause steps 
for reagent bottle changes were minimized. After the synthesis, the 123 base DNA 
30 fragment was deprotected and cleaved from the column support by treatment in 
ammonium hydroxide at 55°C for 6 hrs. 
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Table 7 

5'- ATGGCAGCGAAAGGGGACCGTGCACCAGGAACGCCGGGAGAAGGTCAACAGCACCATCTTGG 
TGGAGCGAAACAGGCAGGCGACGTCGGTAGCCCTGGTGCCTTTCCGCTAAAGTCCTGCCGT -3 ' 



5 The PCR reaction was then performed as previously described using the 

same primers as were used in the construction of the CLP3.7 monomer. The 
amplified DNA was then resuspended and digested with ApaLl and Dral RENs. The 
digested DNA was then purified using a Probind filter followed by a Bio-Spin 
column and then ligated with pPT0310 previously digested with ApaU and EcoRV 

10 RENs and purified by NACS column. The products of the ligation reaction were 
transformed into E. coli strain HB101. Plasmid DNA from transformants was 
purified and analyzed by digestion using EcoO109, Hindi and Hindlll RENs. 
Plasmid DNA from the clones showing the correct size insert was purified and 
analyzed by DNA sequencing. Plasmid pPT0318 contained the desired PPAS1-A 

15 gene monomer sequence (see Table 8). 
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Table 8 

BanI Aval/Smal 

GGTGCCCCGGGTACTCCTGGTCCACAAGGTCTGCCGGGAAGCCCA 
CCACGGGGCCCATGAGGACCAGGTGTTCCAGACGGCCCTTCGGGT 
GAPGTPGPQGLPGSP 



Banll Gsul StuI Drain 

GGGGCTCCGGGTACTCCAGGTCCGCAAGGCCTGCCGGGTTCACCG 
CCCCGAGGCCCATGAGGTCCAGGCGTTCCGGACGGCCCAAGTGGC 
10 GAPGTPGPQGLPGSP 

Bgll BamHI 
GGTGCTCCGGGAACTCCTGGCCCGCAGGGCTTGCCGGGATCCCCA 
CCACGAGGCCCTTGAGGACCGGGCGTCCCGAACGGCCCTAGGGGT 
15 GAPGTPGPQGLPGSP 



20 



25 



GGTGCACCAGGAACGCCGGGAGAAGGTCAACAGCACCATCTTGGT 
CCACGTGGTCCTTGCGGCCCTCTTCCAGTTGTCGTGGTAGAACCA 
GAPGTPGEGQQHHLG 

Aatll BanI 
GGAGCGAAACAGGCAGGCGACGTCGGTAGCCCTGGTGCC - 3 ' 
CCTCGCTTTGTCCGTCCGCTGCAGCCATCGGGACCACGG - 5 ' 
GAKQAGDVGS P (G A) 



Construction of Ex pression Plasmid pPT0317 

Plasmid DNA pSY1262 was linearized with PvuII REN, then passed through 
a Probind filter followed by a Bio-Spin column. The DNA was then treated with 
SAP and ligated with a DNA fragment from pQE-17 (QIAGEN Catalog #33173 ) 
30 prepared as follows. Plasmid DNA pQE-17 was digested with Bglll and Hindlll 
RENs and the 36 bp fragment (see Table 9) was purified using a Probind filter and 
then a Bio-Spin column. The DNA was purified further using a Microcon-30 filter 
and the filtrate containing the 36 bp was kept. The DNA was then treated with DNA 
Polymerase I and purified through a Probind filter and then a Bio-Spin column. 

35 

Table 9 

5 1 - GATCTTCGATCTCATCACCATCACCATCACTA 

3 1 - AAG CT AGAGTAGTGGT AGTGGT AGTGATTCG A 

40 The products of the ligation reaction were transformed into E. coli strain 

HB101. Plasmid DNA from transformants was purified and analyzed by digestion 
using BstYI and Bstll07I RENs. Plasmid DNA from the clones showing the correct 
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restriction pattern was purified and analyzed by DNA sequencing. Plasmid pPT0317 
contained the desired DNA insert and was used for further DNA manipulations. 

PPAM-A Pfl lv ™ pr Construction 

5 Plasmid DNA from pPT0318 was digested with BanI REN and the digestion 

fragments were separated by agarose gel electrophoresis. The PPAS1-A gene 
fragment, 216 bp, was excised and purified using the Ultrafree-MC filter. The 
purified fragment was ligated with plasmid pPT0317 which had been prepared as 
follows. Plasmid DNA pPT0317 was digested with BanI REN, then passed through 

10 a Probind filter and then a Bio-Spin column. The DNA was then treated with SAP. 

The products of the ligation reaction were transformed into E. coli strain 
HB101. Transformants were selected for resistance to kanamycin. Plasmid DNA 
from individual transformants was purified and analyzed using EcoRI and EcoRV 
RENs for the presence of PPAS1-A multimer gene inserts. Several clones were 

15 obtained with insert sizes ranging from 200 bp to approximately 4 kb. Several clones 
containining from 10 to 20 repeats were chosen for use in expression of the PPAS1- 
A polymer. 

ppft£1-A Expression Analysis 

20 E. coli strain HB101 containing plasmid pPT0321, pPT0325, pPT0326, or 

pPT0327 was cultured as previously described. The proteins produced by these cells 
showed strong reactive bands of apparent molecular weights ranging from 80 kD to 
180 kD when analyzed by western blot for reactivity to CLP antibody. One clone, 
pPT0321, containing 10 repeats of the PPAS1-A monomer was selected for further 

25 study. 

pPT0321 PPAS1-A 762 AA MW 68,056 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
t (GAPGTPGPQGLPGSP) 3 GAPGTPGEGQQHHLGGAKQAGDVGSP] 10 
30 GAMDPGRYQDIiRSHHHHHH 
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EXAMPLE 7 

Pn pwiictinn nf SELP8K and SELP8E 

Polymers were prepared designated SELP8K and SELP8E, which are 
characterized by having specific chemically reactive functional groups within the 
5 elastin-like block. The construction of these polymers is described below starting 
from the previous gene monomer, SELPO (see U.S. Patent #5,243,038, pSY1298, 
where SELPO is termed EBSI). 

SELP8K and SELP8E amino acid monomer sequence design: 
10 SELP8K MONOMER ( GAGAGS ) 4 (GVGVP) < GKGVP (GVGVP) 3 

SELP8E MONOMER (GAGAGS) 4 (GVGVP) 4 GEGVP (GVGVP) 3 

Construction of SEL P8 Gene Monomer 

Plasmid pSY1378 (see US Patent #5,243,038) was digested with BanI REN, 
15 purified using agarose gel electrophoresis followed by NACS column, and the DNA 
was then ethanol precipitated in 2.5 M ammonium acetate and ligated with pPT0134 
(See PCT\US92\09485) previously digested with Fokl REN, phenol/chloroform 
extracted and ethanol precipitated. 

The products of the ligation mixture were transformed into E.coli strain 
20 HB101. Plasmid DNA from transformants was purified and analyzed by digestion 
using Nrul and XmnI RENs. Plasmid pPT0255 containing the desired restriction 
pattern was obtained and was used for subsequent constructions. 

Plasmid DNA pPT0255 was treated with CfrlOI REN followed by RNAse. 
The digestion fragments were separated by agarose gel electrophoresis, the DNA 
25 was excised and self-ligated. The products of the ligation mixture were transformed 
into E.coli strain HB101. Plasmid DNA from transformants was purified and 
analyzed by digestion using Nael and StuI RENs. Plasmid pPT0267 containing the 
desired deletion was used for subsequent constructions. 

Two olgonucleotide strands as shown in Table 10 were synthesized and 
30 purified as described in Example 1. 
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Table 1Q 

5 1 - CTGGAGCGGGTGCCTGCATGTACATCCGAGT -3 * 

3*- CCGAGACCTCGCCCACGGACGTACATGTAGGCTCA -5' 

5 The two oligonucleotide strands were annealed and ligated with the DNA of plasmid 

pPT0267 which had been previously digested with Banll and Seal RENs f and 

purified by agarose gel elctrophoresis followed by NACS column. 

The products of this ligation reaction were transformed into E.coli strain 

HB101. Plasmid DNA from transformants was purified and digested with Dral. 
10 Plasmid DNA from two clones that gave the correct digestion pattern was 

sequenced. One plasmid DNA, designated pPT0287, was found to be correct and 

chosen for further constructions. 

Plasmid DNA pSY1298 (see U.S. Patent # 5,243,038) was digested with 

Banll REN, and the SELPO gene fragment was purified by agarose gel 
15 electrophoresis followed by NACS and then ligated to pPT0287 digested with Banll. 

The enzyme was then removed using phenol/chloroform extraction and ethanol 

precipitation. 

The products of the ligation mixture were transformed into Exoli strain 
HB101. Plasmid DNA from transformants was purified and analyzed by digestion 
20 using Dral REN. Plasmid DNA from the clones showing the correct restriction 
pattern was further digested with Banll, Ahall and StuI RENs. Plasmid pPT0289 
contained the desired SELP8 monomer sequence (see Table 11). 

Tabic 11 

BanI Banll 
25 GGT GCC GGT TCT GGA GCT GGC GCG GGC TCT GGA GTA GGT GTG CCA GGT 
CCA CGG CCA AGA CCT CGA CCG CGC CCG AGA CCT CAT CCA CAC GGT CCA 
GAGSGAGAGSGVGVPG 



30 GTA GGA GTT CCG GGT GTA GGC GTT CCG GGA GTT GGT GTA CCT GGA GTG 
CAT CCT CAA GGC CCA CAT CCG CAA GGC CCT CAA CCA CAT GGA CCT CAC 
VGVPGVGVPGVGVPGV 

Smal 

35 GGT GTT CCA GGC GTA GGT GTG CCC GGG GTA GGA GTA CCA GGG GTA GGC 
CCA CAA GGT CCG CAT CCA CAC GGG CCC CAT CCT CAT GGT CCC CAT CCG 
GVPGVGVPGVGVPGVG 
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Ban 1 1 

GTC CCT GGA GCG GGT GCT GGT AGC GGC GCA GGC GCG GGC TCT GGA GCG 
CAG GGA CCT CGC CCA CGA CCA TCG CCG CGT CCG CGC CCG AGA CCT CGC 
VPGAGAGSGAGAGSGA 

5 

rnprmrrinn of SELP8K and SELP8E Gene Monomers 

One oligonucleotide strand coding for a portion of the SELP8 gene monomer 
was synthesized with a single base polymorphism at position 90. The use of both 
adenine and guanidine at this position produced oligonucleotides from a single 

10 synthesis that encoded the amino acids lysine and glutamic acid (see Table 12). The 
synthesis was conducted using an Applied Biosystems DNA synthesizer model 381A 
and a 2000A synthesis column supplied by Glen Research. During the synthesis the 
required interrupt-pauses for bottle changes were minimized. After the synthesis the 
202 base DNA fragment was deprotected and cleaved from the column support by 

15 treatment in 30% ammonium hydroxide at 55 °C for 6 hrs. 

Table 12 

5*- ATGGCAGCGAAAGGGGACCGGGCTCTGGTGTTGGAGTGCCAGGTGTCGGTGTTCCGGGTGTAGG 
CGTTCCGGGAGTTGGTGTACCTGGA ( A/G) AAGGTGTTCCGGGGGTAGGTGTGCCGGGCGTTGGAGTA 
20 CCAGGTGTAGGCGTCCCGGGAGCGGGTGCTGGTAGCGGCGCAGGCGCGGGCTCTTTCCGCTAAAGTCC 

TGCCGT -3* 

Two additional DNA strands were used as primers for PCR amplification. The two 
strands were: 

25 1 . 5 * - AAGAAGGAGATATCATATGGCAGCGAAAGGGGACC-3 ' 

2. 5*-CGCAGATCTTTAAATTACGGCAGGACTTTAGCGGAAA-3 * 

The PCR reaction was carried out and the reaction product was purified as 
described in Example 1. 
30 The DNA was resuspended and digested with Banll REN as described in 

Example 1. The digested DNA was then separated by low-melting agarose gel 
electrophoresis and ligated with pPT0289 previously digested with Banll RENs and 
purified by NACS column. The products of the ligation reaction were transformed 
into Exoli strain HB101. Plasmid DNA from isolated transformants was purified 
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and analyzed by digestion using ApaLI and EcoNI RENs. Plasraid DNA from the 
clones showing the correct restriction pattern were ftirther analyzed by digestion 
using Asp700 REN to distinguish between clones encoding a lysine or glutamic acid 
at the polymorphic position. Plasmid DNA from clones containing each of the 
5 polymorphs was purified and analyzed by DNA sequencing. Plasmid pPT0340 
contained the desired SELP8K monomer sequence and pPT0350 contained the 
desired SELP8E monomer sequence (see Tables 13 and 14, respectively). 



10 



15 



20 



25 



30 



Tabic 13 



Banl Banll 

GGT GCC GGT TCT GGA GCT GGC GCG GGC TCT GGT GTT GGA GTG CCA GGT 
CCA CGG CCA AGA CCT CGA CCG CGC CCG AGA CCA CAA CCT CAC GGT CCA 
GAGSGA GAGSGVGVPG 

EcoNI 

GTC GGT GTT CCG GGT GTA GGC GTT CCG GGA GTT GGT GTA CCT GGA AAA 
CAG CCA CAA GGC CCA CAT CCG CAA GGC CCT CAA CCA CAT GGA CCT TTT 
VGVPGVGVPGVGVPGK 



GGT GTT CCG GGG GTA GGT GTG CCG GGC GTT GGA GTA CCA GGT GTA GGC 
CCA CAA GGC CCC CAT CCA CAC GGC CCG CAA CCT CAT GGT CCA CAT CCG 
GVPGVGVPGVGVPGVG 

Smal Banl I 

GTC CCG GGA GCG GGT GCT GGT AGC GGC GCA GGC GCG GGC TCT GGA GCG 
CAG GGC CCT CGC CCA CGA CCA TCG CCG CGT CCG CGC CCG AGA CCT CGC 
VPGAGAGSGAGAGS GA 



35 Banl 

GGT GCC GGT TCT GGA GCT GGC GCG 
CCA CGG CCA AGA CCT CGA CCG CGC 
GAGSGAGA 



Tabic 14 

Banl I 

GGC TCT GGT GTT GGA GTG CCA GGT 
CCG AGA CCA CAA CCT CAC GGT CCA 
GSGVGVPG 



GTC GGT GTT CCG GGT GTA GGC GTT 
CAG CCA CAA GGC CCA CAT CCG CAA 
VGVPGVGV 



EcoNI 

CCG GGA GTT GGT GTA CCT GGA GAA 
GGC CCT CAA CCA CAT GGA CCT CTT 
PGVGV.PGE 



Asp700 

GGT GTT CCG GGG GTA GGT GTG CCG 
CCA CAA GGC CCC CAT CCA CAC GGC 
GVPGVGVP 



GGC GTT GGA GTA CCA GGT GTA GGC 
CCG CAA CCT CAT GGT CCA CAT CCG 
GVGVPGVG 
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Smal 

GTC CCG GGA GCG GGT GCT GGT AGC 
CAG GGC CCT CGC CCA CGA CCA TCG 
VPGAGAGS 



Banll 

GGC GCA GGC GCG GGC TCT GGA GCG 
CCG CGT CCG CGC CCG AGA CCT CGC 
GAGAGSGA 



SELP8K Polymer Construction 

Plasmid DNA from pPT0340 was digested with BanI REN and the digestion 
fragments were separated by agarose gel electrophoresis. The SELP8K gene 
fragment, 192 bp, was excised and purified by NACS column. The purified 
fragment was ligated with plasmid pPT0317 which had been digested with BanI 
REN, passed through a Millipore Probind and a Bio-Spin 6 column. The DNA was 
then treated with shrimp alkaline phosphatase (SAP) as described in Example 1 . 

The products of this ligation reaction were transformed into Exoli strain 
HB101 . Transformants were selected for resistance to kanamycin. Plasmid DNA 
from individual transformants was purified and analyzed for increased size due to 
SELP8K monomer multiple DNA insertion. Several clones were obtained with insert 
sizes ranging from 200 bp to approximately 7 kb. Clones containining from 6 to 32 
repeats, were used for expression of the SELP8K protein polymer (pPT0341, 
pPT0343, pPT0344, pPT0345 and pPT0347). 

SELP8K Expression Analysis 

E. coli strain HB101 containing plasmids pPT0341, pPT0343, pPT0344, 
pPT0345 and pPT0347 were grown as described in Example 1 . The proteins 
produced by these cells were analysed by Western blot for detection of proteins 
reactive to SLP antibodies. Each clone produced a strongly reactive band. The 
apparent molecular weights of the products ranged from approximately 35kD to 
greater than 250 kD. Strain pPT0345 produced an SLP antibody reactive band of 
apparent molecular weight 80,000. The expected amino acid sequence of the 
SELP8K polymer encoded by plasmid pPT0345 is shown below. 
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pPT0345 SELP8K 884 AA MW 69,772 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPMGAGSGAGAGS 
[ (GVGVP) « GKGVP (GVGVP), (GAGAGS), ) l2 
5 (GVGVP) , GKGVP (GVGVP) 3 (GAGAGS) 7 

GAGAMDPGRYQDLRSHHHHHH 

J\K Construction of SETPOK PoWmcrS 

The copolymer structure of SELP8K consists of silk-like blocks (SLP block) 
10 and elastin-like blocks (ELP block) in the following sequence: [(SLP block) 4 (ELP 
block) g ]. Additional polymers were designed to have different resorption and 
solution properties by adjusting their silk-like to elastin-like block lengths while 
maintaining their reactive properties. SELPOK contains half the length of 
crystallizable silk-like blocks than SELP8K while maintaining the dispersion 
15 frequency with respect to the elastin-like blocks. 

A polymer with intervening sequences to promote in vivo resorption through 
proteolytic cleavage by collagenase (92 kd) was also designed. SELP0K-CS1 
contains two adjacent cleavage sites for collagenase (PLGP) within a six amino acid 
insert (GAGAGS GVGVP L G P L G P GVGVP). 

20 

Construction nf Plasmid pPTQ317 

Plasmid DNA pSY1262 (see U.S. Patent No. 5,243,038) was linearized with 
PvuII REN, then passed through a Probind filter and a Bio-Spin 6 column. The 
DNA was then treated with Shrimp Alkaline Phosphatase (SAP). The linearized 

25 pSY1262 DNA was then ligated with a DNA fragment from pQE-17 (QIAGEN 
Catalog #33173) prepared as follows. Plasmid DNA pQE-17 was digested with 
Bglll and Hindlll RENs and the 36 bp fragment shown in Table 15 was purified 
using a Probind filter and a Biospin column. The DNA was purified further using a 
Microcon-30 filter and the filtrate, containing the 36 bp fragment, was kept. The 

30 DNA was then treated with DNA Polymerase I and purified using a Probind filter 
and a Biospin column (see Example 1). 
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Table 15 

S^-GATCTTCGATCTCATCACCATCACCATCACTA 

3 •-AAGCTAGAGTAGTGGTAGTGGTAGTG ATTCGT 

5 The product of the ligation reaction was transformed into E. coli strain 

HB101. Plasmid DNA from transformants was purified and analyzed by digestion 
using Bstl 1071 and EcoRV RENs. The clones containing the desired DNA fragment 
were further digested with Bstl 1071 and BstYI RENs to determine the orientation of 
the insert. Plasmid DNA from the clones showing the correct restriction pattern was 

10 purified and analyzed by DNA sequencing. Plasmid pPT0317 contained the desired 
DNA insert and was used for further DNA constructions. 

SE3 P n * Polvmer Construction 

One oligonucleotide strand as shown in Table 16 was synthesized using an 
15 Applied Biosystems DNA synthesizer model 381A and a 2000A synthesis column 
supplied by Glen Research. After the synthesis the 93 base DNA fragment was 
deprotected and cleaved from the column support by treatment in ammonium 
hydroxide at 55°C for 6 hours. 

20 Tahlg 16 

5 ' - ATGGCAGCGAAAGGGGACCGGTGCCGGCGCAGGTAGCGGAGCCGGTGCGGGCTCAAAAAGGG 
CTCTGGTGCCTTTCCGCTAAAGTCCTGCCGT -3' 

The PCR reaction was performed using the same two DNA primer strands as 
25 described for the construction of the SELP8K gene monomer and the reaction 
product was purified. The DNA was resuspended and digested with BanI REN. 
The digested DNA was then separated by low-melting agarose gel and ligated with 
pPT0285 (see PCT/US92/09485) previously digested with BanI REN and purified 
by NACS column. The product of the ligation reaction was transformed into E. coli 
30 strain HB101 . Plasmid DNA from transformants was purified and analyzed by 

digestion using EcoRI and Banll RENs. Plasmid DNA from the clones showing the 
correct restriction pattern was then purified and analyzed by DNA sequencing. 
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Plasmid pPT0358 contained the desired sequence and was used for subsequent DNA 
constructions. 

Plasmid DNA from pPT0340 was digested with Banll REN and the digestion 
fragments were separated by agarose gel electrophoresis. The SELPOK gene 
5 fragment, 156 bp, (see Table 17), was excised and purified using an Ultrafree-MC 
filter followed by Bio-Spin 6 column. 

Table 17 

Banll 

G GGC TCT GGT GTT GGA GTG CCA GGT GTC GGT GTT CCG GGT GTA GGC GT7 
10 C CCG AGA CCA CAA CCT CAC GGT CCA CAG CCA CAA GGC CCA CAT CCG CAA 
GSGVGVPGVGVPGVGV 

CCG GGA GTT GGT GTA CCT GGA AAA GGT GTT CCG GGG GTA GGT GTG CCG 
GGC CCT CAA CCA CAT GGA CCT TTT CCA CAA GGC CCC CAT CCA CAC GGC 
15PGVGVPGKGVPGVGVP 

GGC GTT GGA GTA CCA GGT GTA GGC GTC CCG GGA GCG GGT GCT GGT AGC 
CCG CAA CCT CAT GGT CCA CAT CCG CAG GGC CCT CGC CCA CGA CCA TCG 
GVGVPGVGVPGAGAGS 

20 

Banll 

GGC GCA GGC GCG GGC TC 
CCG CGT CCG CGC CCG AG 
G A G A G S 

25 

The purified fragment was ligated with plasmid pPT0358 which had been 
digested with Banll REN, then passed through a Probind filter and a Microcon-30 
filter. The digestion fragments were then separated by agarose gel electrophoresis. 
The plasmid DNA was then excised and purified using an Ultrafree-MC filter 

30 followed by Bio-Spin 6 column (see Example 1). 

The product of this ligation reaction was transformed into £. coli strain 
H8101. Transformants were selected for resistance to chloramphenicol. Plasmid 
DNA from individual transformants was purified and analyzed for increased size due 
to SELPOK multiple DNA insertion. Several clones were obtained with inserts of 

35 different sizes. Plasmid pPT0359, pPT0360 and pPT0374 containing respectively 
18, 2 and 6 repeats of the SELPOK gene monomer were used for subsequent 
constructions. 

Plasmid DNA from pPT0359 and pPT0374 was digested with BanI REN and 
the digestion fragments were separated by agarose gel electrophoresis. The SELPOK 
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gene fragments, approximately 2800 bp and 1000 bp, were excised and purified by 
NACS column. The purified fragments were then ligated with plasmid pPT03I7 
which had been digested with BanI REN, then passed through a Probind filter and a 
Bio-Spin 6 column. The DNA was then treated with Shrimp Alkaline Phosphatase 
5 (SAP), passed through a Probind filter and then a Bio-Spin 6 column (see Example 
1). 

The product of these ligation reactions was transformed into E. coli strain 
HB101. Transformants were selected for resistance to kanamycin. Plasmid DNA 
from individual transformants was purified and analyzed for increased size due to 
10 SELP0K multiple DNA insertion. Several clones were obtained. Plasmid pPT0364 
and pPT0375 were chosen to be used for expression of SELP0K. 

SELPOK Expression Analysis 

E. coli strain HB101 containing plasmid pPT0364 and pPT0375 were grown 
15 as described in Example 1. The proteins produced by these cells were analysed by 
SDS-PAGE for detection of reactivity to ELP antibodies. In every analysis a strong 
reactive band was observed of an apparent molecular weight of approximately 95 kD 
and 35 kD respectively. 

20 pPT0364 SELPOK 1000 AA MW\80,684 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
[ (GAGAGS) 2 (GVGVP)« GKGVP (GVGVP) 3 ] J8 
{ GAGAGS), GAGAMDPGRYQDLRSHHHHHH 

25 pPT0375 SELPOK 376 AA MW 31 , 445 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
[ (GAGAGS) 2 (GVGVP) < GKGVP (GVGVP) J 6 
(GAGAGS) 2 GAGAMDPGRYQDLRSHHHHHH 

30 SELPQK-CS1 Polymer Construction 

Plasmid pPT0360 was digested with BanI REN and the digestion fragments 
were separated by agarose gel electrophoresis. The SELPOK gene fragment, 
approximately 300 bp, was excised and purified using an Ultrafree-MC filter 
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followed by Bio-Spin 6 column. The purified fragment was ligated with plasmid 
pPT0134 (see PCT/US92/09485) which had been digested with Fokl REN. The 
enzyme was heat inactivated at 65 °C for 20 minutes and the ligation mixture was 
then passed through a Probind filter. The DNA was then treated with Shrimp 
5 Alkaline Phosphatase (SAP), passed through a Probind filter and then a Bio-Spin 6 
column. 

The product of this ligation reaction was transformed into £. coli strain 
HB101. Transformants were selected for resistance to chloramphenicol. Plasmid 
DNA from individual transformants was purified and analyzed by digestion using 

10 Dral REN. One plasmid, pPT0363, showed the correct restriction pattern and was 
used for subsequent DNA constructions. 

One oligonucleotide strand as shown in Table 18 was synthesized using an 
Applied Biosystems DNA synthesizer model 381 A and a 2000A synthesis column 
supplied by Glen Research. After the synthesis the 141 base DNA fragment was 

15 deprotected and cleaved from the column support by treatment in ammonium 
hydroxide at 55 °C for 6 hours. 

Table 18 

5 ' - ATGGCAGCGAAAGGGGACCGCCGGTGCGGGCTCTGGTGTTGGAGTGCCGCTGGGTCCTCTTGG 
20 CCCAGGTGTCGGTGTTCCGGGTGTAGGCGTTCCGGGAGTTGGTGTACCTGGAAAAGGTTTCCGCTAA 
AGTCCTGCCGT -3' 

The PCR reaction was performed using the same two DNA primer strands as 
described for the construction of the SELP8K gene monomer and the reaction 

25 product was purified. The DNA was then resuspended and digested with BsrFI and 
EcoNI RENs. The digested DNA was treated with Probind and Microcon-30 filters, 
a Bio-Spin 6 column, and then ligated with pPT0363 previously digested with BsrFI 
REN, treated with a ProBind filter and a Bio-Spin 6 column and then further 
digested with EcoNI REN. The digestion fragments were separated by agarose gel 

30 electrophoresis. The larger DNA band, approximately 2000 bp, was excised and 
purified using an Ultrafree-MC filter followed by Bio-Spin 6 column (see Example 
1). 
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The product of the ligation reaction was transformed into E. coli strain 
HB101 . Plasmid DNA from individual transformants was purified and analyzed by 
digestion using Asp700I and EcoO109I RENs. Plasmid DNA from the clones 
showing the correct restriction pattern was then purified and analyzed by DNA 
5 sequencing. Plasmid pPT0368 (see Table 19) contained the desired sequence and 
was used for subsequent DNA constructions. 

Table 19 

Ban 1 1 

10 6 GGC TCT GGT GTT GGA GTG CCG CTG GGT CCT CTT GGC CCA GGT GTC 
C CCG AGA CCA CAA CCT CAC GGC GAC CCA GGA GAA CCG GGT CCA CAG 
GSGVGVPLGPLGPGV 

GGT GTT CCG GGT GTA GGC GTT CCG GGA GTT GGT GTA CCT GGA AAA 
15 CCA CAA GGC CCA CAT CCG CAA GGC CCT CAA CCA CAT GGA CCT TTT 
GVPGVGVPGVGVPGK 

GGT GTT CCG GGG GTA GGT GTG CCG GGC GTT GGA GTA CCA GGT GTA 
CCA CAA GGC CCC CAT CCA CAC GGC CCG CAA CCT CAT GGT CCA CAT 
20 GVPGVGVPGVGVPGV 

Banll 

GGC GTC CCG GGA GCG GGT GCT GGT AGC GGC GCA GGC GCG GGC TCT 
CCG CAG GGC CCT CGC CCA CGA CCA TCG CCG CGT CCG CGC CCG AGA 
25 GV PGAGAGSGAGAG S 

Plasmid DNA pPT0368 was digested with Banll REN, and the digestion 
fragments were separated by agarose gel electrophoresis. The SELP0K-CS1 gene 
fragment, 174 bp, was excised and purified using an Ultrafree-MC filter followed by 

30 Bio-Spin 6 column. The purified fragment was ligated with plasmid pPT0358 which 
had been digested with Banll REN, then passed through a Probind filter and a 
Microcon-30 filter. Subsequently the digestion fragments were separated by agarose 
gel electrophoresis. The plasmid DNA was then excised and purified using an 
Ultrafree-MC filter followed by Bio-Spin 6 column (see Example 1). 

35 The product of this ligation reaction was transformed into £. coli strain HB101 . 

Transformants were selected for resistance to chloramphenicol. Plasmid DNA from 
individual transformants was purified and analyzed for increased size due to SELP0K- 
CS1 multiple DNA insertion. Several clones were obtained with insert sizes ranging 
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from 1000 bp to approximately 3000 bp. Plasmid pPT0369 containing 16 repeats of 
the SELP0K-CS1 gene monomer was used for subsequent constructions. 

Plasmid DNA from pPT0369 was digested with BanI REN t followed by a 
Probind filter and then the digestion fragments were separated by agarose gel 

5 electrophoresis. The SELP0K-CS1 gene fragment, approximately 2800 bp, was excised 
and purified by an Ultrafree-MC filter and desalted using a Bio-Spin 6 column. The 
purified fragments were then ligated with plasmid pPT0317 which had been digested 
with BanI REN and then passed through a Probind filter and a Bio-Spin 6 column. The 
DNA was then treated with Shrimp Alkaline Phosphatase (SAP), passed through a 

10 Probind filter and then a Bio-Spin 6 column (see Example 1). 

The product of these ligation reactions was transformed into £. coli strain 
HB101. Transformants were selected for resistance to kanamycin. Plasmid DNA from 
individual transformants was purified and analyzed for increased size due to SELP0K- 
CS1 multiple DNA insertion. Several clones were obtained. Plasmid pPT0370 was 

15 chosen to be used for expression of SELP0K-CS1 . 

SELEQ&CSJ Expression Analysis 

£. coli strain HB101 containing plasmid pPT0370 was grown as described in 
Example 1 . The proteins produced by these cells were analysed by SDS-PAGE for 
20 detection of reactivity to ELP antibodies. In every analysis a strong reactive band was 
observed with an apparent molecular weight of approximately 90 kD. 

pPT0370 SELPOK-CS1 934 AA MW 76,389 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
25 [(GAGAGS) 2 (GVGVP) x LGPLGP (GVGVP) 3 GKGVP (GVGVP) 3 ] 15 

( GAGAGS ) 2 GAGAMDPGRYQDLRSHHHHHH 

As is evident from the above results, highly repetitive sequences can be 
prepared, cloned, and used for expression to produce a wide variety of products which 
30 may mimic natural products, such as silk and other proteins and antigens. In addition, 
novel systems are provided for controlling the expression of the peptide under inducible 
conditions in a variety of hosts. In this manner, new proteinaceous products can be 
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provided which provide for new properties or may closely mimic the properties of 
naturally occurring products. 
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All publications and patent applications mentioned in this specification are 
indicative of the level of skill of those skilled in the an to which this invention pertains. 
All publications and patent applications are herein incorporated by reference to the same 
extent as if each individual publication or patent application was specifically and 
5 individually indicated to be incorporated by reference. 

The invention now being fully described, it will be apparent to one of ordinary 
skill in the art that many changes and modifications can be made thereto without 
departing from the spirit or scope of the appended claims. 
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AyfJAT TS CLAIMED IS: 

1. A method of preparing a synthetic DNA sequence having repeating units of 
from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 

5 method comprising: 

(1) synthesizing at least two different pairs of single stranded oligomers 
wherein each of the oligomers of a pair overlap except as to any protruding ends; 

(2) hybridizing each pair of single stranded oligomers to provide double 
10 stranded segments; 

(3) combining said segments or amplified copies thereof in a cloning vector 
to form a monomer, where the combined segments are in reading frame; 

15 (4) excising said monomer from said cloning vector by restriction enzyme 

digestion; and 

(5) oligomerizing said monomer to provide a multimer comprising at least 
two monomers. 

20 

2. A method according to Claim 1, wherein at least one said restriction 
enzyme cuts at an asymmetric consensus sequence or at a site distal from the consensus 
sequence. 

25 3. A method according to Claim 1, wherein said monomer has protruding 

termini which are complementary to each other. 

4. A method according to Claim 1, wherein said pairs of oligomers includes 
at least two different pairs encoding the same amino acid sequence. 

30 

5 . A method according to Claim 1 , wherein said pairs of oligomers includes 
at least two different pairs encoding different amino acid sequences. 
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6. A method according to Claim 1, wherein the number of pairs of 
oligomers is in the range of 2 to 4 and at least one oligomer has protruding ends on the 
same strand. 

5 7. A method according to Claim 1 , wherein said repeating units have from 

4 to 12 codons. 

8. A method according to Claim 1, wherein at least a portion of said 
monomer is sequenced prior to oligomerizing to provide said multimer. 

10 

9. A method of preparing a synthetic DNA sequence having repeating units 
of from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 
method comprising: 

(1) synthesizing at least three different pairs of single stranded oligomers 
15 wherein each of the oligomers of a pair overlap except as to any protruding ends; 

(2) hybridizing each pair of single stranded oligomers to provide double 
stranded segments each having from 21 to 90 bases to provide at least three segments, 
each segment having a different nucleic acid sequence and having complementary ends 

20 to contiguous segments; 

(3) combining said segments or amplified copies thereof in a cloning vector 
to form a monomer, where the combined segments are in reading frame; 

25 (4) excising said monomer from said cloning vector by restriction enzyme 

digestion; and 

(5) oligomerizing said monomer to provide a multimer comprising at least 
two monomers. 

30 

10. A method according to Claim 9, including the additional step of 
analyzing said monomer prior to said oligomerizing. 
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11. A method according to Claim 9, wherein at least one repeating unit 
encodes an amino acid sequence consisting of: 

5 Gap, GAGAGS, GVGVP, VPGVG, SGAGAG, and AGAGSG, 

wherein a and P are any amino acid, a and P being selected so that the encoded protein 
comprises from about 10 to 45 number % proline. 

10 12. A method of preparing a DNA sequence having repeating units of from 

about 3 to 15 codons and encoding a protein of at least about 30kDal, said method 
comprising: 

( 1 ) synthesizing at least three different pairs of single stranded oligomers of 
15 from about 30 to 100 bases, wherein each of the oligomers of a pair overlap except as 

to any protruding ends; 

(2) hybridizing said pairs of oligomers to provide segments; 
20 (3) inserting a first segment into a linearized cloning vector; 

(4) sequencing said first segment to ensure the fidelity of said sequence; 

(5) sequentially linearizing said vector with a restriction enzyme cleaving 
25 proximal to a terminus of said segment, and adding additional segments at a terminus 

of the prior segment, by digesting said cloning vector with a restriction enzyme cutting 
at a site proximal to a terminus of the preceding segment, inserting each successive 
segment in reading frame with the prior segment, and cloning the cloning vector 
comprising said next segment, to provide a monomer; 

30 

(6) excising said monomer from said cloning vector; 
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(7) oligomerizing said monomer to provide at least one multimer comprising 
at least two monomers; 

wherein the sequences of said segments and vector are selected to permit 
insertion of said segments and excision of said monomer by restriction enzyme digestion 
5 with restriction enzymes which cleave at asymmetric consensus sequences or distal from 
the consensus sequence. 

13. A method according to Claim 12, including the additional step of 
analyzing said monomer prior to said oligomerizing. 

10 

14. A method according to Claim 12, wherein at least one repeating unit 
encodes an amino acid sequence consisting of: 

Gap, GAGAGS, GVGVP, VPGVG, SGAGAG, and AGAGSG, 

15 

wherein a and p are any amino acid, a and P being selected so that the encoded protein 
comprises from about 10 to 45 number % proline. 

15. A method of preparing a synthetic DNA sequence having repeating units 
20 of from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 

method comprising: 

(1) synthesizing at least two different pairs of single stranded oligomers 
wherein each of the oligomers of a pair overlap except as to any protruding ends; 

(2) hybridizing each pair of single stranded oligomers to provide double 
stranded segments; 

(3) cloning a first segment in a cloning vector and analyzing the cloned first 
30 segment to determine the fidelity of the sequence, discarding any segment having an 

erroneous sequence; 
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(4) either: 

(a) adding each successive segment in reading frame to prior 
segments to provide a monomer and determining the fidelity of the sequence of each 

5 successive segment; or 

(b) cloning each successive segment in a cloning vector and 
analyzing each successive segment to determine the fidelity of the sequence and 
combining said segments or amplified copies thereof in a cloning vector to form a 

10 monomer, where the combined segments are in reading frame; 

(5) excising said monomer from said cloning vector by restriction enzyme 
digestion; and 

15 (6) oligomerizing said monomer to provide a multimer comprising at least 

two monomers. 

16. A method according to Claim 15, including the additional step of 
analyzing said monomer prior to said oligomerizing. 

20 

17. A method according to Claim 15, wherein at least one repeating unit 
encodes an amino acid sequence consisting of: 

Gap, GAGAGS, GVGVP, VPGVG, SGAGAG, and AGAGSG. 

25 

wherein a and p are any amino acid, a and P being selected so that the encoded protein 
comprises from about 10 to 45 number % proline. 

18. A method of preparing a synthetic DNA sequence having repeating units 
30 of from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 

method comprising: 
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(1) synthesizing at least two different pairs of single stranded oligomers 
wherein each of the oligomers of a pair overlap except as to any protruding ends; 

(2) hybridizing each pair of single stranded oligomers to provide double 
5 stranded segments; 

(3) isolating at least one double stranded segment from a previously 
synthesized monomer; 

10 (4) combining said segments or amplified copies thereof in a cloning vector 

to form a monomer, where the combined segments are in reading frame; 

(5) excising said monomer from said cloning vector by restriction enzyme 
digestion; and 

15 

(6) oligomerizing said monomer to provide a multimer comprising at least 
two monomers; 

wherein synthesized segments are sequenced to ensure the fidelity of replication. 

20 

19. A method according to Claim 18, including the additional step of 
analyzing said monomer prior to said oligomerizing. 

20. A method according to Claim 18, wherein at least one repeating unit 
25 encodes an amino acid sequence consisting of: 

Gap, GAGAGS, GVGVP, VPGVG, SGAGAG, and AGAGSG, 

wherein a and P are any amino acid, a and p being selected so that the encoded protein 
30 comprises from about 10 to 45 number % proline. 
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21 . A method of preparing a synthetic DNA sequence having repeating units 
of from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 
method comprising: 

5 (1) synthesizing a single strand encoding a monomer comprising from about 

100 to 300 bases comprising repeating units of from about 3 to 15 codons flanked by 
primer binding sites; 

(2) preparing a complementary strand and amplifying the resultant dsDNA 
10 monomer by polymerase chain reaction using primers of from 15 to 50 nt having a 

sequence comprising a sequence which does not hybridize with the repetitive portion 
of the sequence of said single strand and further comprising none or a portion of the 
consensus sequence of a restriction site, the single strand completing the consensus 
sequence where the primer comprises a portion of the restriction site, wherein said 
15 primers are characterized by being different for each end of the dsDNA, having at least 
40%GC, a T m of at least 85 °C when hybridizing with the complementary primer and 
the differing between the two primers by not more than 2°C; 

(3) cloning said dsDNA monomer and selecting for a monomer having the 
20 correct sequence; and 

(4) oligomerizing said monomer to provide at least one multimer comprising 
at least two monomers. 

25 22. A method according to Claim 21, wherein at least one repeating unit 

encodes an amino acid sequence consisting of: 

Gap, GAGAGS, GVGVP, VPGVG, SGAGAG, and AGAGSG, 

30 wherein a and p are any amino acid, a and P being selected so that the encoded 

protein comprises from about 10 to 45 number % proline. 
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23. A method according to Claim 21, comprising the further steps of 
removing said primers with a restriction enzyme resulting in protruding ends and 
excising said dsDNA monomer after cloning with a restriction enzyme resulting in 
protruding ends. 

5 

24. A method of preparing a synthetic DNA sequence having repeating units 
of from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 
method comprising: 

10 (1) synthesizing at least two different pairs of single stranded oligomers of 

wherein each of the oligomers of a pair overlap except as to any protruding ends; 

(2) hybridizing each pair of single stranded oligomers to provide double 
stranded segments; 

15 

(3) combining said segments or amplified copies thereof in a cloning vector 
to form a monomer, where the combined segments are in reading frame; 

(4) excising said monomer from said cloning vector by restriction enzyme 
20 digestion; 

(5) oligomerizing said monomer to provide a multimer comprising at least 
two monomers; 

25 wherein the sequences of said segments and vector are selected to permit 

insertion of said segments and excision of said monomer by restriction enzyme 
digestion; 

(6) inserting said multimer in an expression vector functional for expression 
30 in an expression host; 

(7) introducing said expression vector into said expression host; and 
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growing said expression host, whereby said protein polymer is expressed. 

25. A method according to Claim 24, including the additional step of 
purifying said expressed protein polymer. 

5 

26. A method according to 24, wherein at least one repeating unit encodes 
an amino acid sequence consisting of: 

Gap, GAGAGS, GVGVP, VPGVG, SGAGAG, and AGAGSG, 

10 

wherein a and P are any amino acid, a and p being selected so that the encoded protein 
comprises from about 10 to 45 number % proline. 

27. A method according to Claim 24, wherein said expression host is £. 

15 coli. 

28. A method of preparing a synthetic DNA sequence having repeating units 
of from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 
method comprising: 

20 

(1) synthesizing a single strand encoding a monomer comprising from about 
100 to 300 bases comprising repeating units of from about 3 to 15 codons flanked by 
primer binding sites; 

25 (2) preparing a complementary strand and amplifying the resultant dsDNA 

monomer by polymerase chain reaction using primers of from 20 to 45 nt having a 
sequence wherein said primer does not hybridize with the repetitive portion of the 
sequence of said single strand and comprising none or a portion of a restriction site 
proximal to the terminus of said primer, wherein said primers are characterized by 

30 being different for each end of the dsDNA, having at least 40%GC, a T m of at least 
85 °C with the T m between the two primers differing by not more than 2°C and; 
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(3) cloning said dsDNA monomer and selecting for a monomer having the 
correct sequence; 

(4) oligomerizing said monomer to provide at least one multimer comprising 
5 at least two monomers; 

(5) inserting said multimer in an expression vector functional for expression 
in an expression host; 

10 (6) introducing said expression vector into said expression host; and 

growing said expression host, whereby said protein polymer is expressed. 

29. A method according to Claim 28, including the additional step of 
purifying said expressed protein polymer. 

15 

30. A method according to 28, wherein at least one repeating unit encodes 
an amino acid sequence consisting of: 

Gap, GAGAGS, GVGVP, VPGVG, SGAGAG, and AGAGSG, 

20 

wherein a and P are any amino acid, a and P being selected so that the encoded protein 
comprises from about 10 to 45 number % proline. 

31. A method according to Claim 28, wherein said expression host is E. 

25 coli. 

32. A method of preparing a synthetic DNA sequence having repeating units 
of from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 
method comprising: 

30 (1) synthesizing a single strand encoding a first segment of a monomer, 

wherein said monomer comprises from about 100 to 300 bases comprising repeating 
units of from about 3 to 15 codons; 
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(2) preparing a complementary strand and amplifying the resultant first 
dsDNA segment by polymerase chain reaction using primers of from 15 to 50 nt having 
a sequence comprising a sequence which does not hybridize with the repetitive portion 
of the sequence of said single strand and further comprising none or a portion of the 

5 consensus sequence of a restriction site, the single strand completing the consensus 
sequence where the primer comprises a portion of the restriction site, wherein said 
primers are characterized by being different for each end of the dsDNA, having at least 
40%GC, a T ro of at least 85°C when hybridizing with the complementary primer and 
the T m differing between the two primers by not more than 2°C; 

10 

(3) cloning said first dsDNA segment into a first cloning vector and 
selecting for a second double stranded DNA segment having the correct sequence; 

(4) excising said second double stranded segment from said first cloning 
15 vector by restriction enzyme digestion; 

(5) isolating a third double stranded segment from a previously synthesized 
monomer; 

20 (6) combining said second and third double stranded segments or amplified 

copies thereof in a second cloning vector to form a monomer, wherein the combined 
segments are in reading frame and wherein said monomer comprises from about 100 
to 300 bases comprising repeating units of from about 3 to 15 codons; 

25 (7) excising said monomer from said second cloning vector by restriction 

enzyme digestion; and 

(8) oligomerizing said monomer to provide a multimer comprising at least 
two monomers. 

30 

33. The method according to Claim 32 wherein said previously synthesized 
monomer is prepared according to the method comprising: 
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(1) synthesizing at least two different pairs of single stranded oligomers 
wherein each of the oligomers of a pair overlap except as to any protruding ends; 

(2) hybridizing each pair of single stranded oligomers to provide double 
5 stranded segments; 

(3) combining said segments or amplified copies thereof in a cloning vector 
to form a monomer, where the combined segments are in reading frame; 

10 (4) excising said monomer from said cloning vector by restriction enzyme 

digestion; and 

(5) oligomerizing said monomer to provide a multimer comprising at least 
two monomers. 

15 

34. The method according to Claim 32 wherein said previosuly synthesized 
monomer is prepared according to the method comprising: 

(1) synthesizing a single strand encoding a monomer comprising from about 
20 100 to 300 bases comprising repeating units of from about 3 to 15 codons flanked by 

primer binding sites; 

(2) preparing a complementary strand and amplifying the resultant dsDNA 
monomer by polymerase chain reaction using primers of from 15 to 50 nt having a 

25 sequence comprising a sequence which does not hybridize with the repetitive portion 
of the sequence of said single strand and further comprising none or a portion of the 
consensus sequence of a restriction site, the single strand completing the consensus 
sequence where the primer comprises a portion of the restriction site, wherein said 
primers are characterized by being different for each end of the dsDNA, having at least 

30 40%GC, a T m of at least 85 °C when hybridizing with the complementary primer and 
the T m differing between the two primers by not more than 2°C; 
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(3) cloning said dsDNA monomer and selecting for a monomer having the 
correct sequence; and 



(4) oligomerizing said monomer to provide at least one multimer comprising 
5 at least two monomers. 
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